AMERICAN 
JOURNAL OF MATHEMATICS 


FOUNDED BY THE_JOHNS HOPKINS UNIVERSITY 


EDITED BY 


ABRAHAM COHEN 


E. T. BELL 
THE JOHNS HOPKINS UNIVERSITY 


CALIFORNIA INSTITUTE OF TECHNOLOGY 


4 
F. D. MURNAGHAN 


T. H. HILDEBRANDT 
UNIVERSITY OF MICHIGAN THE JOHNS HOPKINS UNIVERSITY 


J. F. RITT 
COLUMBIA UNIVERSITY 


WITH THE COOPERATION OF 


MARSTON MORSE G. C. EVANS OYSTEIN ORE 
E. P. LANE AUREL WINTNER H. P. ROBERTSON 


ALONZO CHURCH GABRIEL SZEGO M. H. STONE 
L. R. FORD R. L. WILDER T. Y. THOMAS 
OSCAR ZARISKI R. D. JAMES G. T. WHYBURN 


PUBLISHED UNDER THE JOINT AUSPICES OF 


THE JOHNS HOPKINS UNIVERSITY 
AND 


THE AMERICAN MATHEMATICAL SOCIETY 


VOLUME LX 
1938 


THE JOHNS HOPKINS PRESS 
BALTIMORE, MARYLAND 
U. S. A. 


i 
i 
| 


JAN 25 1938 


AMERICAN 
JOURNAL OF MATHEMATICS 


FOUNDED BY THE JOHNS HOPKINS UNIVERSITY 


EDITED BY 


E. T. BELL ABRAHAM COHEN 
CALIFORNIA INSTITUTE OF TECHNOLOGY THE JOHNS HOPKINS UNIVERSITY 


T. H. HILDEBRANDT F. D. MURNAGHAN 
UNIVERSITY OF MICHIGAN THE JOHNS HOPKINS UNIVERSITY 


J. FE. RITT 
COLUMBIA UNIVERSITY 
WITH THE COOPERATION OF 


MARSTON MORSE G. C. EVANS OYSTEIN ORE 

E. P. LANE AUREL WINTNER H. P. ROBERTSON 
ALONZO CHURCH GABRIEL SZEGO M. H. STONE 

L. R. FORD R. L. WILDER T. Y. THOMAS 
OSCAR ZARISKI R. D. JAMES G. T. WHYBURN 


PUBLISHED UNDER THE JOINT AUSPICES OF 


THE JOHNS HOPKINS UNIVERSITY 
AND 


THE AMERICAN MATHEMATICAL SOCIETY 


Volume LX, Number 1 
JANUARY, 1938 


THE JOHNS HOPKINS PRESS 
BALTIMORE, MARYLAND 
U. S. A. 


ematics 
| 


CONTENTS 


PAGE 


On certain points in the theory of algebraic differential equations. By 


The analysis of the direct product of irreducible representations of the 
symmetric groups. By F. D. MurnacHay, > 


On a class of arithmetical Fourier series. By Puinip HarrMay, . 
The structure of local class field theory. By O. F. G. ScHILiine, ; 
Groups whose commutator subgroups are of order two. By G. A. Mituzr, 


Convergence of a sequence of linear transformations. By M. H. In- 


The interrelations of the fundamental solutions of the hypergeometric 
equation; logarithmic case. By Lyte E. MEHLENBACHER, 


The theorems of Gauss-Bonnet and Stokes. By E. R. van Kampen, 
Homomorphism of rings and fields of point sets. By Morris Kunz, . 


Polynomial ideals defined by infinitely near base points. By Oscar 


Integral forms and variational orthogonality. By Puinip HarTman and 


On translations in general plane geometries. By HzrBert BUSEMANN, . 


THe AMERICAN JOURNAL OF MATHEMATICS will appear four times yearly. 

The subscription price of the Journat for the current volume is $7.50 (foreign 
postage 50 cents); single numbers $2.00. 

A few complete sets of the JOURNAL remain on sale. 

Papers intended for publication in the JouRNAL may be sent to any of the Editors. 

Editorial communications may be sent to Dr. A. CoHEN at The Johns Hopkins 
University. 

Subscriptions to the JourNaL and all business communications should be sent to 
THE JOHNS HopKINsS Press, BALTIMORE, MARYLAND, U.S. A. 


Entered as second-class matter at the Baltimore, Maryland, Postoffice, acceptance for mailing at special 
rate of postage provided for in Section 1103, Act of October 8, 1917, Authorized on July 3, 1918. 


PRINTED IN THE UNITED STATES OF AMERICA 
BY J. H. FURST COMPANY, BALTIMORE, MARYLAND 


66 

102 

129 | 

139 


FRANK MORLEY 
1860-1937 


4 
4 
a 
‘ 
ON 
OF 


EDITOR 
1900-1937 


ON CERTAIN POINTS IN THE THEORY OF ALGEBRAIC 
DIFFERENTIAL EQUATIONS.* 


By J. F. Rirt. 


The topics considered in this paper are: 


I. Forms in several unknowns. 
II. Pairs of forms. 
III. Essential mantfolds composed of one solution. 
IV. An approximation theorem. 
V. Essential irreducible manifolds in the manifold of a form. 
VI. Equations in two unknowns, of the first order. 


The results, under each topic, are described at the head of the section 


dealing with that topic. 


I. Forms in Several Unknowns. 
1. Let us consider an algebraically irreducible form A in the unknowns 
Yi’ * ‘3 Yn. The general solution of A has n —1 arbitrary unknowns.’ It is 
natural to inquire as to the possibilities for the number of arbitrary unknowns 
in the other essential irreducible manifolds in the manifold of A. This inquiry 
is answered by the following theorem: , 


Given a non-zero form* A in + *,Yn, every essential irreducible 
manifold in the manifold of A has n—1 arbitrary unknowns. 


In other words, every essential irreducible manifold in the manifold of A 


is the general solution of a form in y1,° + *5 Yn 
2. It will evidently suffice to prove that every solution 
(1) Yi = My) 


of A is contained in an irreducible manifold, held by A, which has n—1 
arbitrary unknowns. We wish to show that we may restrict our examination 
to the case of »; —0,i1—1,:--,n. Let the »; in a given solution (1) be 


* Received July 27, 1937. 
+A set of arbitrary unknowns of a system of forms will be called a set of arbitrary 


unknowns for the manifold of the system. 
* Algebraic irreducibility is not necessary. 


| 
1 


J. F. RITT. 


adjoined to the underlying field.* Under the substitution yj; = % + ni, 
A goes over into a form A’ in +,2n. To prove that z; —0,i—1,---,n, 
belongs to an irreducible manifold with —1 arbitrary unknowns, held by 
A’, will be to show that (1) is embedded as described above. Accordingly we 
assume in what follows that A has the solution y; =0, 1—1,---,n, and 
we limit ourselves to the study of that solution. 


3. Let B represent the sum of the terms of lowest degree in A con- 
sidered as a polynomial in the y;;. Changing the notation if necessary, we 
assume that B cae involves one or more 4;. 

Let v2,° ° n be functions of x with a common domain of itty 
which, when respectively for - -,Yn in B, reduce B to a form 
(’ in y; which involves one or more ¥,; effectively. 

Representing by ¢ an arbitrary constant, we put, in A, 


(2) Yi = (i = 2,---,n.) 


Then A goes over into an expression D in 2, ¢, 4. 
We shall prove the existence of a formal power series 


described as in §11 of our paper, On the singular solutions of algebraic 
differential equations,* which causes D to vanish identically in # and ¢ when 
substituted for y, in D. 


4. For ¢,, we take any function which annuls C above when substituted 
for y, in C. If D vanishes for y; = ¢:¢, ¢:c can be used for (3). In what 
follows, we assume that such vanishing does not occur. 


5. In D, we put y; = ¢i¢ + uw where u, is a new unknown. Then PD 
goes over into an expression H’ in 2z,c,u,;. Designating by m the order of 


D in y;, we arrange H’ as a polynomial in +, Uim. We write 
, — , >) # Api + + 
(4) H’ = a'(c) + (c) us 


Here a’(c) and the 6’(c) are polynomials in ¢ with coefficients which are 
functions of xz. The terms in & are those which are not free of wyo,° + * , Uim 
and we understand that no b’; is zero. We know that a’(c¢), which equals 
D(¢,c), is not zero. As to i, it ranges from unity to some positive integer. 


*If necessary, with a shrinking of the domain of meromorphicity for the field. 
* Annals of Mathematics, vol. 37 (1936), p. 552. Denoted below by S.S. 


Nis 
by 
Wwe 
and 


Mic 


1m 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 3 


Let o’ be the least exponent of c in a’ and o’; the least exponent of c in 
b’:(c). Let 

Omi 
where 7 has the range which it has in &. 

We shall prove that p. > 1. 

Let 7 be the degree of B of § 3. Let us consider D as a polynomial in c 
and the y,;.. Then each term of D has a degree not less than r. There are 
terms of degree 7, and their sum is obtained by making the substitution (2) 
in B. Because C(¢,) =0, the sum just mentioned vanishes for y,; = ¢,¢. 
Thus, as a’(c) = D(dic), we have o’ 

Under the substitution y, = ¢.c + uw, a term of D of degree s in c and 
the y,; produces a set of terms of degree s in ¢ and the u,;. In particular, 
the terms of degree r in D will contribute to H’ power products in c and the 
uy; effectively involving the u,; and of degree vr. This means that, for certain 


(5) pe = Max 


1, we have, in (4), 


For such i, we have, since é > 

This, by (5), shows that p. > 1. 


6. We now take over §§ 12-15 of S.S., substituting y, for uw) and the 
expression “solution of D of type (3)” for the expression “ solution of H 
of type (15).” Using the argument of § 16 of S.S., we have the series (3) 
sought for D. 


7. Let us suppose now that the solution y,; =0,i1—1,---,n, of A, is 
not contained in a manifold with n — 1 arbitrary unknowns. In a decomposition 


of A into closed essential irreducible systems, let 31,- - -, 3+ be those systems 
which admit the solution y; =0,i—1,:--,n. Let A; be a non-zero form 
in t= 1,- - -,¢, involving only Let H=A,A2° At. 


We choose v; as in § 3 which do not annul the sum of the terms of lowest 
degree in H. Then does not vanish identically in and for = 


i= 2,---,n. Thus, in the decomposition of A, there is some essential irre- 
ducible system distinct from %,,- - -, 3+ whose forms all vanish for y; = vic, 
i= 2,---,n and for y, as in (3).5 Such a system must admit the solution 


yi =0,1—1,---,n. This completes the proof of the theorem stated in § 1. 


®Cf. 8.S., §17. Note that A vanishes for the indicated substitutions. 


| 


4 J. ¥: RITT. 


II. Pairs of Forms. 


8. We prove the following theorem. Let A and B be non-zero forms w 
Yi," ° *,Yn. Let Bhold A. Let A, be the sum of the terms of lowest degree 
in A considered as a polynomial in the yi; and let B, be the corresponding 
sum for B. Then B, holds A,. 

A similar result holds for the terms of highest degree. 


9. Remark. The simplest case is that in which B is a linear combina- 
tion of A and of the derivatives of A. One might expect that B, would then 
be a linear combination of A, and of its derivatives. We shall show by means 
of an example that this need not be so. 

Let 


dA 


be forms in the unknown y. Then A; ~y,*, B, =B. If B, were a linear 
combination of A, and its derivatives, y*y2 would be such a linear combina- 
tion. If the weight of y; is defined as j, y*°y2 and y,? have weight 2 and the 
derivatives of y,? have weight in excess of 2. Thus, y*°y2 would have to be 
simply a multiple of y,?.. This proves our statement. From the expression 
of B in terms of A, one might now conjecture that some power of B, is a linear 
combination of A, and of the first derivative of A,. In that case, some power 
of y*y. would be such a linear combination. This is impossible, since y*y2 is 
not divisible by y;. Actually, the cube of B, is linear in A, and its first two 


derivatives. 


10. We enter into our proof. If A, is free of the unknowns, B, cer- 
tainly holds A,. In what follows, we assume that the terms of A, are of 


positive degree. Then A vanishes for y, —0,i1—1,---,n. 
We shall prove the permissibility of assuming that A, contains a term 
involving only the Let 22,° + and Wn be new unknowns. 


Let yi, for 1 > 1, be replaced in A; by %-+ wi. Then A, goes over into a 
form C in y,, the z and w;. C contains terms free of the 2;;; the sum D of 
such terms in C’ is found by substituting w; for y; in A, fori >1. Let t- 
be an integer which exceeds the order of D in y;. On putting w. = yz, in D. 
we convert D into a non-zero form D, in y:, wW3,: + *,Wn. We now replace 
wz in D, by y:1,, where ts exceeds the order of D, in y,. Continuing, we find 
a substitution 

(6) Yi = 4 + 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. dD 


which converts A, into a form / in y; and the 2, H possessing terms free of 
the 23. The terms of # will have the same degree as those of Ai. 

The substitution (6) may be applied to A and B and will give a situation 
in which # takes the place of A,;. This proves the legitimacy of the assump- 
tion described above, and, in what follows, A, will be understood to have terms 


involving only the yj. 
11. We now refer to § 3. Let 


be any solution of A,;. By §3, there is a series (3) such that A vanishes 
identically in 2 and ¢ when y; is replaced by vic for 1 > 1 and y, is replaced 
by (3). Some power of B is a linear combination of A and its derivatives. 
Thus, B must vanish for the above replacements. This means that (7) is a 
solution of B,, so that our theorem is proved. 


12. The case of the terms of highest degree, mentioned in § 8, is per- 
haps most conveniently handled as follows. Let A; and B, represent the sums 
of the terms of highest degree in A and B respectively. Using unknowns 


U3 we put, in A and B, 


(8) 
We have then 

A =C/u", A, = C;/u" 

B = D/u", B, = D,/u™ 


with m a positive integer and C, C,, D, D, forms in u and the 2. C, and D, 
will be the sums of the terms of least degree in C and D respectively. Because 
B holds A, uD certainly holds C. By what precedes, uD, holds C,. Because 
every solution of A, yields solutions of C, with u0, B, holds A,. 


III. Essential Manifolds Composed of One Solution. 


13. We use the unknowns We consider a system com- 


posed of n forms 


(9) + Fy, 


where, for each i, p; is a positive integer and F’; a form which either is identi- 
cally zero or else is composed of terms each of which is of total degree greater 
than in the yx, k2Z0. 

We shall prove that the solution of & given by y.=0, 1=—1,---,n, 
is an essential irreducible manifold in the manifold of %. 


ree 
ny 
la- 
en 
Ns 
ar 
a- 

e 
De 
ar 
oT 
is 
yf 


6 J. F. RITT. 


14. We assume the statement to be false. Then the mentioned solution 
is contained in an irreducible manifold which is held by = but which, for 
some 1, is not held by y;. Fixing our ideas, let us suppose that y, does not 
hold this manifold. Then there is a value a of x at which the coefficients in 
the F; are analytic such that, for every positive integer m and for every 
e > 0, & has a solution, analytic at a, of the type 


(10) yi = bij —a)s, -,n), 
with 
(11) | | <e, 1,---,2; j= 0,- --,m), 


and with b,5 

We represent by s and ¢ respectively two positive numbers which will be 
fixed later. Considering a definite solution (10). which corresponds to given 
m,e, we put 
2—a 


(12) 


Then the y; in (10) become analytic functions of z for z small. Again, 


we let 
(13) wi (z) =c-*y; (2), (t==1,---,n), 
with z and « related as in (12). 
Kach equation y;”' + F; = 0 goes over into an equation 
(14) + =0 
j=l 


where the B are power products in the wx and their derivatives with respect 
to z, the » positive integers and the v non-negative integers. Each a; is the 
coefficient in F; of the power product which produces B; and we regard the 
a;, for any c, as functions of z. It is unnecessary to express the dependence 
on of in (14). 

If s is large, then, for every i, the wjs —v; in (14) will all be positive. 
We fix s at a value large enough for this to be realized. Of course. s is inde- 
pendent of m,e¢ and the solution (10). 

We have, by (10), 

ae 

(15) = > 52, 


j 0 


We shall now fix c. Let v be the greatest integer less than s. We choose 
¢ in such a way that the greatest of the quantities 


| bi; 


5 


| 
c 


tion 


for 


not 
sin 


very 


in, 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. ( 


equals unity. This is possible because bi) 0. Then, if m =v and if ¢ is 
small,.c will be small. 

When 1/m and « decrease toward zero, the coefficient of z/ in (15) for a 
fixed j exceeding v, and for any fixed i, tends toward zero. For, if j > 1 
then 7=s. 

It follows that, by decreasing 1/m and e¢, we can select a sequence of 
solutions (10) which yields, for every i, a sequence of w; which tends toward 
a polynomial which is either identically zero or else of degree v at most. The 
selection can be made in such a way that, for some i, the w; converge to a 
polynomial distinct from zero. Fixing our ideas, we assume that the sequence 
of w, tends toward a polynomial y(z) distinct from zero. 

We now consider (14) fori—1. When c is small and the w; are close 
to their polynomial limits, the expansion of & in (14) in powers of z will 
begin with a large number of small coefficients. This contradicts the fact that 
y" ~ 0, so that our result is established. 


IV. An Approximation Theorem. 


15. In A.D. H.,° § 74, we established a result equivalent to the following : 
Let be a non-trivial closed irreducible system in Yn. Let F be 
any form which does not hold 3%. Let {,: + +,&, be any solution of 3, ana- 
lytic in some area B. There exists a set of points, dense in ¥, such that, given 
any point a of the set, any positive integer m and any e > 0, & has a solution 
15° * *,&n, analytic at a, which does not annul F at a, such that, for every 1. 


Sl> 


each of the first m+ 1 coefficients in the Taylor ——- of €;— ¢; at a is 
of modulus less than e. 

We are going to derive here the following stronger conclusion from the 
nypothesis in the above theorem. There exists a set of points, residual? in &, 
such that, given any point a of the set, any two positive integers r and m and 
any >0, has a solution &,° analytic at a, which does not annul 
F at a, such that, for every i, the r-th roots of & —& are analytic at a and 
have Taylor eapansions ala in which the first m + 1 coefficients are all less 


than in modulus. 


16. Remarks. The residual set of points a is not offered as one of the 


attractions of the above result. What is noteworthy is the use of the r-th roots. 


Residual sets occur in the proof, and nothing is lost in using such a set in the 
statement of the theorem. We note at this point that, in the approximation 


64, D.E. will stand for our Colloquium Lectures. 
7 The complement in § of the sum of a countable number of sets each of which is 


nowhere dense on §. 


& J. F. RITT. 


theorem of A. D. L., the dense set of points a may be replaced immediately by 
a residual set. In short, for the m and ¢ on page 102 of A. D. E., one may 
use any point a of the area %’. Thus, the points a which may not be used for 
given m,¢ are nowhere dense in 9%; the points a for which some impossible pair 
m,e can be found form a set of the first category. 

The strenger approximation theorem, which, as will be seen in § 26, is 
not without utility, is a first result in a program to perfect the approximation 
theorem of A. D. H. It would be natural to conjecture that £,.- - - ,£, can be 
embedded analytically in a one-parameter family of solutions which do not 
annul F, That this is not so can be seen from § 90 of S. S8., where the singular 
solution y = 0 of 
(16) 


is discussed. That singular solution belongs to the general solution of (16). 
If (16) were satisfied, for every small h, by 


y = $(2,h) 


with ¢ analytic for 2 in some area and h small, and with ¢ vanishing iden- 
tically in « for h =O but not vanishing identically in x and h, the first 
member of (16) would have a convergent y-solution. However, the only 
y-solution which exists is divergent for every y0. 

Having disposed of the above conjecture, one might ask whether 
cannot always be approximated uniformly in some area by solu- 
tions which do not annul #. Such a result would certainly not imply 
immediately the one which will be established here. For instance, as / 
approaches zero, y = h* +- ha approaches zero uniformly in any bounded area. 
but the coefficient of in the expansion of at = 0 does not tend toward 


zero with h. 


17. We shall show that it will suffice to consider the case in which the ¢; 
are identically zero. We adjoin the » functions ¢; to the underlying field. 
limiting the domain of a if necessary. becomes equivalent, for the enlarged 
field, to one or more essential irreducible systems. One of these systems, call 
it 3’, will admit a sequence of solutions which do not annul F and which 
approach {,,---,€, in the manner described in A.D.H., $74. Then 


t,° ° °,¢n is a solution of 3’ and F does not hold 3’. Under the substitution 
y¥i=2% goes over into an irreducible system in the F goes 
over into a form G in the z;. Then z; —0, i—1,- - -,n will be a solution 


of >” and @ will not hold 3”. This is enough for the justification of our 


statement. 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 9 


18. Let P be a form in 41,° * -, Yn, given by 
(17) P=y2(1+A)+B 


where p is a positive integer, A a form vanishing for y,; = 0, i—1,-- -,n, 
and Ba form which, when written as a polynomial in y, and its derivatives, 
has no term of degree less than p-+ 1. 

It is easy to prove, subjecting x and y, to a Painlevé transformation, that 
the solution y; = 0, 1—1,---,n, of P is not contained in any irreducible 
manifold which is held by P but not by y;. 


19. In accordance with § 17, we assume that 

Let y; be any one of the n unknowns. In the first stage of our proof we 
shall show that, given any positive integer r, there exists a dense set of values 
of x such that, given any a of the set and any m and e, & has a solution 
£,,° * *,& which does not annul F at a, the r-th roots of &; being analytic at 
a and the first m + 1 coefficients in the expansions at a of 


all having moduli less than e. 
If y; holds &, our result is seen directly to hold. In what follows we 
assume that y; does not hold 3%. 


20. Let ¥:,° ° -, Yq be a set of arbitrary unknowns for = (if such a set 
exists). Whether or not y; is among these arbitrary unknowns is of no 
importance. Let 


be a basic set for introducing Let Si represent the separant 
of A, in (18). 

No generality is lost in assuming that F of § 15 is divisible by y; and 
by each S;. In what follows, we assume such divisibility. 

Let the r of § 19 be given and let an m then be selected. We make the 
non-restrictive assumption that, for every 1, m exceeds the order of F in %. 
We consider the forms in (18) and also the first m derivatives of each of them. 
We secure thus a set of (m+ 1)(n—q) forms which we shall now regard 
as simple forms in the yix. The set of simple forms thus obtained will be 
denoted by ® and the unknowns in ©® will be taken as those yi for which 
k =r, + m where r; is the highest of the orders in y; of the forms in (18). 

Let II be the set of simple forms in the unknowns just described which 
vanish for all solutions of ® which annul no S;. It is easily proved that II 
is a prime system. (Cf. A. D. E., § 73.) 


by 
may 
for 
pair 
}, 1s 
ion 
be 
not 
lar 
6). 
en- 
rst 
nly 
er 
lu- 
ly 
h 
rd 
bi 
d, 
d 
ah 
nl 


10 J. Bs) 


Because & has the solution y; = 0, I has a solution with every yix zero. 
Also II is not held by F. 

Let p=mr. Introducing a new unknown v, we put, in I, yjo = v”. 
Then II goes over into a system A in v and the yj distinct from yjo. Let F 
with yjo replaced by v? be represented by C. Then C does not hold A. 

Let yj:,° - *,yj¢ be those unknowns in A which arise from derivatives 
(proper) of y;. We put, in A, 


Then A becomes a system Q in the yx with i<j, in v and the w;. Let J) 
be the form into which C is converted by (19). Then D does not hold Q. 


21. In a decomposition of Q into essential prime systems, let 2,,° - -,, 
be those systems which are not held by D. We are going to show that there 
is some 9; each of whose forms vanishes when the unknowns are all replaced 
by 0. 

Let this be false. Then there exists a form K, with no term free of the 
unknowns, such that 1+ K holds every Q;. Let g be the degree of K con- 
sidered as a polynomial in the w;. We replace each w; in 1-+ K by yji/v?' 
and multiply the resulting expression through by v7. We obtain a form 
R given by 

R= + M 


where M is a form of the following description. Every term of M which 
involves no yj is of the form 
7, 


with a a function of « and L a power product of positive degree. Every term 


in M involving some yj, effectively is of the type 
yu 
with ZL a power product free of the yj, and with 


d= 


Thus 

(21) d+p(d,+---+d:) >g(p—1). 
In other words, 

(22) R=vPV1+G)+H 


where G, free of the yjx, vanishes when the unknowns are all replaced by 0 
and where the terms of H, all of which involve the yj, are of the type (20) 
with (21):holding. Furthermore, Cf holds 


8 Note that C is divisible by v. 


= 


ch 


m 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 11 


Let. » be a primitive p-th root of unity. We replace v in FR successively 
by w'v, 1=1,- - -, p. We obtain a set of forms A,,- - -, Ap whose product 
is a form with coefficients in ¥. T will involve v with exponents which are 
all multiples of p. Also, CT’ holds A and T has an expression 


(23) + P) + Q 


where P (whose exponents of v are all multiples of p) vanishes when the 
unknowns are replaced by 0 and where Q has terms of the type (20) with / 
divisible by p and with 


(24) d+ > g(p—1)p. 


In 7’, we replace v? by yj and consider the resulting form U as a differen- 
tial polynomial. Then U is of the form 


+0 


where V vanishes for y; =0, 1—1,:--,n and where the degree of each 
term of W in y; and its derivatives exceeds g(p—1). 

Now FU holds &. Then U holds 3. According to $18, the solution 
yi = 0, 1=1,---,n of U cannot be approximated by solutions of U with 
4; 0. This contradicts the fact that U holds 3. We have thus proved the 
statement made at the head of the present section. 


22. Fixing our ideas, let us suppose that every form in Q, vanishes when 
the unknowns are all replaced by 0. 

Considering 2, and D, we refer to § 68 of S.S. Corresponding to each 
unknown in ©,, we find a function ¢(z,h), analytic for 2 in some area and 
for kh small, with ¢(2,0) equal to 0 for every x The substitution of all of 
the (2, h) for their corresponding unknowns produces 0 for each form of ©, 
but not for D. 

Passing from 0, to A, we have a $(2,h) for every unknown in A, with 
each # identically zero in.a for h =0, and with each form in A, but not C. 
reducing to zero for every x and h when the unknowns are replaced by the ¢. 
Let the expansion in powers of h of the ¢ for v begin with a term in h? with 
d>0.° Then the expansion of the ¢ for any yj will, if it is not identically 
zero, begin with a power of h whose exponent exceeds (p—1)d. 

We now consider II. For the unknowns in II we have a set of @ which 
annul the forms in II but not FY. The lowest exponent of h in the ¢ for yjo 


® Because D is divisible by v, the @ for v is not identically zero. 


t F 
D 
ere 
he 
yn- 
rm 
0 
) 


12 J. F. RITT. 


will be pd while the lowest exponent for any yj with k >0 will exceed 
(p—1)d. 
In what follows, the ¢ used will be those for II. 


23. We shall use an area Wf, in the plane of z which satisfies the follow- 
ing conditions : 

(a) The coefficients in F and in the A; in (18) are analytic in %,. 

(b) The ¢ are analytic for z in Y%, and h small. 

(c) The coefficient of h?4 in the ¢ for yj) vanishes nowhere in 4. 

(d) The coefficient of the lowest power of h in the result obtained by 
substituting the @ into F vanishes nowhere in %,. 


24. Let a be any value of x in M,. We put 2 =a in the ¢, whereupon 
the @ go over into functions y of h, analytic for h small and vanishing for 
h == 0, 

Every A; in (18) vanishes when z is replaced by a and the other letters 
by their corresponding y. A similar statement holds for the first m derivatives 
of the A;. Let us consider A Aig the (m + 1)-st derivative of A,g,,; in (18). 
In ga we replace @ by a and every yi which occurs in II by its y. For 
every 14, Yi,risms1 With 7; as in § 20, may appear in — With such 
ietters, whether or not they figure effectively in as we associate functions 
W(h) which are identically zero and, where one of the letters figures effectively 
in olga we replace it in that form by 0. After these substitutions, il 
hecomes a linear expression in 


(25) 


with coefficients which are functions of h. The coefficient of the letter (25) 
will be Sg,, with substitutions as above. Because F is divisible by Sq, and 
because (d) of § 23 holds, that coefficient is not zero. When the expression 
obtained from A‘"*» is equated to zero, the letter (25) is determined .as a 
function of h which is analytic, or has a pole, for h=0. We treat Aim 
similarly, substituting for the letter (25) the function of h just found. We 
proceed similarly with the (m + 1)-st derivatives of the other A;. When this 
step is concluded we treat in a similar way the higher derivatives of the Ai. 

The net result of the total operation is as follows. We obtain a set of 
functions y of h, one y for every yx. The y are analytic, or have a pole, for 
h = 0; in particular the y for k = m are all analytic, and equal to zero, for 
h=0. For iq, and for k > m, the y are zero. The y for yjo has a least 
exponent of h equal to pd, while, for a yj, with 0 << k= ™m, and with a y not 
identically zero, the least exponent exceeds (p—1)d. There is a § > 0 such 
that the y are all analytic for | h | < 8, except, perhaps, for h = 0. 


ceed 


low- 


by 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 13 

For h small and distinct from zero, the y become numbers which are 

derivatives in a normal solution of (18), the solution being analytic for z = a 

and not annulling # at a. In such a solution, the y with iq will be 
polynomials. 

We examine y; in these solutions. There is furnished for it, by what 


precedes, an expansion 
(26) Yo(h) + yr(h)(a—a) 


where the lowest exponent of h in yp is pd, while that in y% with k= m 
exceeds (p — 1)d. 

For h small, and not zero, the r-th roots of y; are analytic at a. 

We write the series (26) in the form 


(27) Wo(h)[1 + 


Then, for k = m, the expansion about h = 0 of 6; is either zero or has a least 
exponent which exceeds — d. 

Since p = mr, the least exponent of h in y is mrd. Then the r-th roots 
of Yo are analytic at the origin. Let y(h) be such an r-th root. The least 
exponent of h in y is md. 

One of the r-th roots of the bracket in (27) has an expansion 


148,(h)(e—a) +: 


in which & with k = m begins with a term in h of exponent greater than — kd. 

Thus, in an 7-th root of y; as given by (26), the coefficient of («—a)* 
with k = m will begin with a positive power of h. This is enough to prove 
the result stated in § 19.?° 


25. We conclude the proof of the result stated in §15. Consider any r. 
Under the substitution y, = v,", % goes over into a system A and F into a 
form in v, *, Yn. From what goes before, it follows that some essen - 
tial irreducible system 3, in the decomposition of A contains the solution 
v1, = 0; =0,7> 1, and is not held by F,. We give with respect to ye. 
the treatment accorded to & with respect to y;. Continuing, we reach a system 
=, and a form F, in unknowns Un, such that F, does not hold 
and that v1; 0, i—1,---,n, is a solution of This solution of can 
be approximated at the points a of a residual set by solutions which do not 
annul F, at a and which have expansions at a with as many small coefficients 


° For m as given, any a in J, will serve for every «. Thus there is a dense set 
of points a, in fact a residual set, which serve for all m, e. 


pon 
for 
ers 
ves 
8). 
or 
ch 
ons 
ely 
+1) 
1 
5) 
nd 
on 
a 
+1) 
To 
Lis 
of 
or 
or 
st 
ot 


14 I< 2. BRITT. 


us one pleases. For the particular r used, the transformation y; = v4" gives 
the solutions - -,é, of § 15. Furthermore, the residual sets which corre- 
spond to the values 1, 2, 3,---, of r have a residual set in common. The 


result of § 15 is thus completely proved. 
26. As an application, we consider a form 
G = yy”! A 


where the p; are non-negative integers whose sum is positive and where A is 
a form of the following description : 

(a) Each term of A has a degree in the yi which exceeds p; +--+ ++ pn. 

(b) Given any term LZ of A, and any yj, L is either divisible by y;” 
or else of degree higher than pj; in the yjx. 

We shall prove that the solution yi; =0,i1—1,---,n of G ts not con- 
lained in any irreducible manifold which is held by G but not by yiy2° * * Yn. 

This result appears not to be obtainable readily through the Painlevé 
transformation. 

Using a positive integer 7 which will be fixed in a moment, we replace 
each y; in G by v;". If r is large enough, A will go over into a form in which 


each term is divisible by v,”"- - + v,%". Let r be thus taken. Then G goes 
over into a form H which is the product of v,?"- + - vn," by a form K of 
the type 

1+8B 
where B vanishes for =0,1—1,°--,n. 


If our result were not true, it would follow from §15 that K has solu- 
tions which, for suitable points a, have expansions which begin with as many 


arbitrarily small coefficients as one may desire. Q. KE. D. 


V. Essential Irreducible Manifolds in the Manifold of a Form. 


27. With a view towards later applications, we shall extend here the 
results of S.S., Part I, to forms in several unknowns. 


28. Following S.8S., §§ 1-3, one secures the following result. 

Let F and A be two forms in y;,° - *, Yn, both of class n and algebraically 
irreducible. Let the orders of # and A in y» be m and 1 < m respectively. 
Let A; represent the j-th derivative of A and S§ the separant of A. Then 
there exist a non-negative integer t and a positive integer r such that S‘F has 


a representation 


j=1 


m- 


Zives 
orre- 


The 


lu- 
ny 


the 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 15 


with non-negative p; and %;, where no two of the r sets i,;,° + +, %m-1,j are 
identical; the C; being forms which are of orders not exceeding | in yn and 
which are not divisible by A. 

For any admissible ¢, (28) is unique. In what follows, the smallest ¢ 


will be used. 


29. Let # hold the general solution of A. Then, for the general solution 
of A to be an essential irreducible manifold in the manifold of F, it is neces- 
sary and sufficient that (28) possess a term of the type Cj;A”, which term, wf 
(28) ts considered as a polynomial in A, A,,* * +, Am-1, 1s of lower degree than 


every other term of (28). 


The sufficiency proof proceeds as in § 6 of S. 8S. 

For the necessity proof, we assume that, among the terms of lowest degree 
in (28), there is a term which involves derivatives of A. We prove that the 
general solution of A is not essential. . 

According to Part I of the present paper, the manifold of F consists of 
the general solutions of certain forms B,,---+,Ba. If there are By whose 
orders in y, do not exceed 1, let 7’ denote the product of such By. Otherwise, 
let T=1. Let T be arranged as a polynomial in the yn; and let U be any 
coefficient in 7. Then U, being a form in y;,° - +, Yn-1, does not hold the 
veneral solution of A. 

Considering (28) as a polynomial in A and the Aj, we take its terms of 
lowest degree and select from them those terms which have a highest degree 
in Am. From the terms just taken, we select those for which the degree in 
Am11 is highest. We continue through A,. Our process isolates a single 
term of (28), with a definite Cj. This C; will be used in what follows. 

We put #—UO;S. Let %,--*+*,%n be any solution in the general 
solution of A which does not annul 7. We put, in (28), 


(29) Yi = Yi, — 15 Yn = Yn + Uo, 


with uw a new unknown. By S.8S., § 10, (28) goes over into a form in wo 
which vanishes for uw = 0 and which has, among its terms of lowest degree 
in the woz, terms of order higher than / in wo. 

According to §§ 11-16 of S.S., F has a formal solution 


(30) Yi = <n; 


where ¢, is any solution of a differential equation of order higher than J. 
We follow § 17 of 8.8. Let 3;,i—1,- - -,d, be the closed system whose 


manifold is the general solution of B; above. Then every solution (30) is a 


is 
Pn. 
yj”! 
Yn- 
levé 
ace 
oes 
of 
lly 
ly. 
en 
Las 
| 


16 J.-F. BRITT. 


formal solution of some 3;. We say that there is a solution (30) which is a 
solution of some 3; whose B; has an order higher than / in yp. Let this be 
false. Then every solution (30) annuls 7 above. Under the substitution 
(29), T goes over into a form V(u,) which is annulled by every series 
gic +--+. Because the 9% with i < n do not annul U, V is not identically 
zero. The order of V in uo does not exceed 1. The proof is now completed as 
in S.8., §17. 


VI. Equations in Two Unknowns, of the First Order. 
GENERALITIES. 


30. We deal with an algebraically irreducible form F in the unknowns 
u and v. F has an order in wu and an order in v. We shall assume that the 
maximum of these two orders is unity. 

The manifold of F consists of the general solutions of’ forms 


(31) PF, B,,: Be 


The B; are determined by the methods of A. D. H., Chapter V, with the help 
of Part V of the present paper. 

Clearing the ground for further operations, we shall show that, if there 
are B; in (31), they are of order zero in u and in v. Let us consider B,. 
Because F holds the general solution of B,, the remainder of F with respect 
to B,, say for the order u, v of the unknowns, is zero. Thus, the order of B, 
in v cannot exceed unity. If that order were unity, F would be divisible by 
B,. Thus, the B; are simple forms. 


31. The problem to which this Part VI is devoted is that of determining 
the solutions which any B;, call it B, has in common with the general solution 
of F. 

Fixing our ideas, we assume that B is not free of v. Following § 28, 
we write 


StF = + C, BoB’ ++ + 0, 


with S and B’ the separant and derivative respectively of B. The orders ot 
the C in v and in w do not exceed 0 and 1 respectively and no (’ is divisible 
by B. For every 1, p< pit gi 

The sufficiency proof of the result in § 29 brings out the fact that ever) 
solution which B has in common with the general solution of F is a solution 
of Cy. Let us suppose that B and Cy, have common solutions. Using the 
order u, v of the unknowns, let basic sets be obtained for a set of closed irre- 
ducible systems whose manifolds make up the manifold of the system Co, B. 


is a 
s be 
tion 
ally 
as 


the 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 1% 


We shall prove that each basic set consists of two forms. The resultant of C, 
and B with respect to v is a non-zero form in u alone, of order at most unity 
in u. Hence, given any closed irreducible system & held by Cy and B, a basic 
set of & starts with a form U in u, of order at most unity. Because B involves 
v, B is not divisible by U. Hence the remainder of B with respect. to U is not 
zero. Thus, the basic set of % has a second form, of order zero in v, which 
introduces v. 

Let then 
(32) U,V 


be a basic set for 3, one of the closed irreducible systems considered above. 
U is of order at most unity in w and V is of order zero in v. 

It will be proved in § 68 that, if U is of order unity, the manifold of > 
is contained in the general solution of F. 

As to the case in which U is of order zero, a theorem of E. Gourin '' 
shows that if = has a solution in common with the general solution of F, the 
manifold of = is contained in that general solution. It thus becomes a question 
of deciding whether a given solution, uy, v =€ of & is contained in the 
general solution. As in S.8S., § 88, this case can be reduced to the case of 
u=v=0. 

We shall therefore undertake the investigation of the following problem. 
Let F vanish foru=v=0. It is required to determine whether u=v = 0 
1s contained in the general solution of F. 

The case of interest, of course, is that in which one or more B; in (31) 
vanish for wu =v = 0. 

Through § 67, in which a summary of results is given, we shall be occu- 
pied with the problem just stated. Thus, through § 67, F will vanish for 

=v=0. 
ELEMENTS. 


32. We consider a relation of the type 


The p are positive rational numbers with a common denominator, which 
increase with their subscripts. The ¢ are functions of 2, all analytic in some 
area’? It is understood that the second member of (33) may be identically 
zero. If we differentiate (33) formally, we secure a relation 


11 Bulletin of the American Mathematical Society, vol. 39 (1933), p. 593. 
12In the definition of y-solution in 8.8., only a common point of analyticity was 
demanded of the coefficients. The reason for using an area here will appear in § 55. 


2 


Ws 
elp 
ere 
D1. 
ect 
B, 
by 
ng 
on 
8, 
of 
le 
ry 
on 
he 
| 


18 RITT. 


The series for 0v/du may contain a finite number of negative powers of u. 

The substitution of the above expressions for v and v, into F produces a 
polynomial in wu, whose coefficients are series in w which may contain a finite 
number of negative powers. If this polynomial in wu, vanishes identically, we 
shall call the second member of (33) an element of F. 

For instance, if F = v, + v — u, —u, F has wu as an element. Examples 
of elements can be given, with constant ¢, which diverge for every u 0. 


MULTIPLICITIES. 
33. Let an element of F be given by (33). If 


OF 
35 
Ov ’ OvP-1 


all vanish identically in «, u, uw, when v is taken as in (33), while 0°F'/dv? 
does not, the element will be said to be of multiplicity p. 

If F has an element, v or v, must figure in F. We shall prove that an 
element of F' has a multiplicity if and only if F, considered as a polynomial 
in v, is of positive degree. 

If F is of zero degree in v, every 0‘F'/dv‘ vanishes identically. This proves 
the necessity of the condition. Let the condition be satisfied and let 


with n=1. Suppose that the n forms d‘F/dv', i=1,- - -,n, all vanish for 
(33). We have 

An 


so that «, vanishes for (33). Again, 


1! 


so that @,., must vanish for (33). Continuing, we find every ; to vanish 
for (33). Because F is algebraically irreducible, %,- --,@, are relatively 
prime as polynomials in uw, wu, vi. Hence, some linear combination of them, 
with suitable forms for coefficients, is a non-zero form free of v,, that is, a 
non-zero form in uw. Such a form cannot vanish for (33). This proves the 


sufficiency. 


a 
a 
. 


an 
nial 


yves 


for 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 19 


Strona SuMs. 


34. We consider the effect of making, in F, the substitution 


where the /: numbers p are positive and rational, and increase with their sub- 
scripts. It is understood that the second member of (36) may be identically 


zeTO. 
Under this substitution, #’ goes over into a finite sum 


(37) 


with each a; a function of x, each a; a rational number (possibly negative), 
and each B; a non-negative integer. 

If (37) is not identically zero, and if, among those of its terms for which 
z; + 8; is a minimum, there are terms with 8; > 0, we shall call the second 
member of (36) a strong sum for F. 

Erample 1. has asa strong sum. In fact, 
u+ uw, with p any sufficiently large rational number, will be a strong sum. 


Leample 2. u-+ uv, has no strong sum. 


From any strong sum of a form F’, new strong sums can be derived, as in 
Kxample 1, by the addition of terms. 

The part played by elements and strong sums will be as follows. Suppose 
that the manifold of the form wu is not contained in the general solution of F. 
It will turn out that for «=v = 0 to be in the general solution, it is neces- 
sary and sufficient that F have either a strong sum, or else an element which 
causes no B; in (31) to vanish when substituted for v. 


INDICES. 


35. JF is to be as in § 30 and is to admit wu =v =—0 as a solution. We 
denote by 9% the region in which the coefficients in #’ are meromorphic. 

Suppose first that F has no strong sum. We shall call a positive integer 
n the index of F if there exists a set of points €, contained in %f and having 
no limit point in %f, such that, given any simply connected region %, in % 
which contains no point of €, F’ has a finite set of distinct elements which 
satisfy the following conditions: 

(a) The coefficients in the elements are analytic throughout %,. 

(b) The elements have multiplicities and the sum of the multiplicities 


for the elements of the set is n. 


eS a 
inite 
We 
ples 
0. 
| 
ish 
aly 
m, 
he 


20 J. F. RITT. 


(c) Every element of F with coefficients analytic in some area contained 
in A, coincides with some element of the set. 


It is easy to see that there can be no more than one n as above. 

If F has no strong sum and no elements, the index of F will be defined 
as zero. 

If F has a strong sum, the index of F' will be defined as o. 

Our work will show that if / has no strong sum, but has elements, a posi- 
tive n exists as above. Thus an index will be known to exist for every I 
which satisfies our assumptions. The index will play a réle analogous to that 
of the y-solution number in 8S. 8. 


POLYGONS. 
36. We write F in the form 


(38) > Biyriy 
4=1 
with the a; functions of « distinct from 0. 

We put A — yi + 8i, wi = a; + Bi, and, in a plane referred to rectan- 
gular axes, plot the points (Ai, ui). We secure thus r or fewer points, each 
point associated with one or more terms of F. 

We consider those of the plotted points which have a least abscissa—say 
the abscissa £,—and choose from them that point which has a least ordinate. 
say the ordinate o,. For all points (Ai, wi) with A; > £1, if such exist, we 
form the ratio 


39 
(39) 
which is the slope of the straight segment joining ({,,0:) to (Ai, yi). Let 
us suppose that there are segments whose slopes (39) are negative. Taking 
those segments whose slope is a minimum, we choose the longest of them. 
Let its right extremity be denoted by (£2, 02). 

It may be that there are points with A; > £2 for which 
—— Pi 


(40) 


If so, we take those points which minimize the first member of (40) and 
choose from them that point (3,03) whose abscissa is the greatest. 

We continue this construction as long as it is possible to secure segments 
of negative slope. The polygon formed by the segments obtained will be called 
the polygon of F. 


ved 


red 


nat 


eh 


ay 


We 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 21 


If there are no points with A; > %, or if there are no such points for 
which (39) is negative, the polygon of F is defined as the point (£,,0:). 

When we speak of the points (Ai, ui) lying on a side of a polygon, the 
extremities of the side will be included. 

Consider a point (Aj, wi), plotted for F, which lies on the polygon of F. 
If there is a term associated with (Ai, i) which involves either wu, or v1, we 
shall call (Ai, i) a b<point. If no such term exists, that is, if (Ai, 4) is 
associated with only a single term of F and that term is free of u, and 1, 
we shall call (Ai, wi) an a-point. 


INVESTIGATION OF THE INDEX. 


37. We denote the polygon of F by P. Let (£t,0+) be the point of 
greatest abscissa on P, that is, the rightmost point on P. If F is written as 
in (38), o; will be the least of the quantities a; + 8;, and + will be the least 
value of y; + 8; in those terms of (38) for which a + Bi =r. 

We are going to work toward the result that F' has an index and the index 
of F as etther {: or oo. 


38. We begin by showing that if P has a b-point, F has strong sums. 
We take first the case in which ? has at least one side. Let J be some 
side of P on which a b-point lies. Let —p be the slope of 1. We make, in 
(38), the substitution 
(41) v == wuP 


with w an indeterminate which admits of differentiation with respect to 2. 
We have 
(42) Vv, = + pww 


Under (41), a term of (38) associated with a point (Ai, wi) will yield a 
set of terms, all of degree pi + pAi in wand u,. If (Ai, wi) is on I, this degree 
will be the intercept of 1 on the axis of ordinates. Points not on 1 produce 
terms of degree greater than this intercept. 

Let A be the sum of those terms of F which correspond to points on /. 


Under (41). A goes over into a sum 
(43) 


with the a; polynomials in w and w,. Clearly, if (43) involves u, effectively 
we can fix w as a function ¢ of z so as to make gu? a strong sum for F. 

We shall examine now the case in which (43) is free of u,. In this case 
(43) reduces to an expression Bu’ with q the intercept of / and B a polynomial 


et 
ng 
ts 
4] 


22 J. F. RITT. 


in w and w,. Now w, must appear effectively in B, for (43) goes over into | 
by the substitution w = vu. 
Let C be an irreducible factor of B which involves w, and let 


(44) B=CD. 


Let ¢ be a function of 2 which, substituted for w, annuls C but not 0C/dw,. 
In (44) we put 
w=oth, 


with ¢, the derivative of ¢ and h and k& indeterminates. The members of 
(44) become identical polynomials in h and k. The terms of lowest degree 
produced by C are of the first degree and there is a term Bk with B0. It 
follows that B produces a polynomial H in which the terms of lowest degree 
involve k effectively. 

Taking any positive rational number 8, we put 


h = = 


Then H goes over into a finite sum of type (37) in which the terms of lowest 
degree involve 
All in all, if we make in A, above, the substitution 


(45) v = + 


with ¢ as just fixed and 8 rational and positive, A goes over into a sum of the 
type (37) in which the terms of lowest degree involve u. 

Suppose now that 6 is very small. Then the terms yielded by A under 
(45) will all have degrees close to the intercept of J and will thus have lower 
degrees than the other terms produced by F under (45). Thus, the second 
member of (45) is a strong sum for F. 

In the case in which ? consists of a single point which is a b-point, we 
start with p as any positive rational number and the above argument goes 
through. 


39. From this point until the end of § 47, we assume that P has no 
b-point. Two preliminary questions will be treated in this section. 

We shall show that 0 is not a strong sum for F. Let this be false. Then 
F must have terms free of v and v, and, in the sum of such terms, w, must 
figure among the terms of lowest degree. This implies that P has a b-point. 
so that our statement is proved. 

Suppose that £, of § 36 exceeds 0. Then v ~0 is an element of F. We 


gree 


west 


no 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 2a 


say that v —0 which, according to § 33, has a multiplicity, has the multi- 
plicity £,. It is only necessary to observe, for this, that F has terms free of 1; 
and that the minimum of the degrees in v of such terms is ¢,. 


40. Let ? consist of a single point (£:,01). We say first that F has no 
strong sum. We know that 0 is not a strong sum. Let the substitution (36) 
with ¢;~ 0 be made in F. The term of F in u%vh will produce a set of terms 
of which one, a term in w%*4, will have a least degree. The familiar intercept 
argument shows that this term in wu alone will have a lower degree than any 
other term in the sum (37) which F yields. 

What just precedes shows also that F can have no element distinct from 0. 

The discussion of § 39 and of the present section proves the theorem of 
§ 37 for the case in which ? consists of a single point, in particular, for the 


case of £; = 0. 


41. Assuming now that ® has sides, we undertake to determine the 
possibilities for p, and ¢, in a strong sum, or in an element distinct from 0. 

We shall prove that p,; is the negative of the slope of some side of P. 
Let this be false. Let the vertices of P, arranged according to increasing 
abscissas, be 


If there are sides of P of slopes greater than — p,, let (£;,0;) be the first 
point from the left in (46) which is the extremity of such a side. Otherwise 
let 7 =¢. We consider a line through (£;,0;) of slope —p;. Then all points 
plotted for F' other than (j,0;) lie above this line. It follows, as in § 40, 
that a substitution v = ¢,w+-.-- in F produces a non-zero term, free of 
u,, which is of lower degree than any other term produced. This proves our 


statement. 


42. Let / be a side of P and let —p, be the slope of 1. We make in I 
the substitutions (41), (42), with p—p;. The terms of F associated with 
points on / produce, collectively, an expression L,(w)u7 where L, is a poly- 
nomial in w, and 7+ the intercept of J on the axis of ordinates. The degree of 
L, is the abscissa of the right extremity of J]. The remaining terms of I 
produce, collectively, an expression M, which, arranged as a sum of power 
products of u and w;, will have all its terms of degree greater than 7 in wand uw. 

Clearly, if ¢,w% is to be the first term of an element or of a strong sum, 


we must have 
(47) (¢1) 0. 


oA 
Ow. 
s of 
oTee 

It 
the 
1der 
ywer 
ond 
we 
hen 
ust 
int. 
We 


24 J. 


43. Let ¢, be any solution of (47) distinct from zero.’* We make, in F, 
the substitution 


(48) v= v’. 


Then F goes over into an expression /” in v’ and u. F” will be a polynomial 
in v’ and v’;. with coefficients which are sums of power products of wu and uw. 
The exponents of wu, in the coefficients will be non-negative integers; those of 
u will be rational numbers and some of them may be negative. 

We write F’ in the form (38). Now, however, the «; may be fractional, 
and even negative. We form a polygon for F” in the manner explained in 
§ 36. We denote this polygon by P’. We are going to study P’. 

A term in F in 
(49) 


associated with the point (y + 8, « + 8), contributes to #” terms coming from 
(50) + + + pipiw uy)? 


with ¢’, the derivative of ¢,. Let us consider any term coming from (50). 
If its degree in v’ and vw’; is y +8—a with OSaSy-+4, its degree in 
and wu, will be «+ 8+ pia. Such a term will be associated with the point 


(51) pia). 


One of the points (51) will be (y+ 8, «+ 8). The others lie on a line 
sloping upward from that point, with slope — px. 

Now, let 4 represent the point which is the right extremity of 1. What 
precedes makes it geometrically obvious that h and all plotted points on ? 
to the right of h are points plotted for F’ and, indeed, the lowest plotted 
points of their respective abscissas. Thus, h is a vertex of P’ and P and 
’ coincide from h onward to the right, that is, they have the same plotted 
points, which are a-points for both of them. 

We shall now examine P’ to the left of h. Let ¢, be a solution of (47) 
of multiplicity p. We shall call p the multiplicity of ¢,. Under the substitu- 
tion (48), the terms of F associated with points on / will produce collectively 
the expression 


(52) Ly + wh’) ut. 


The term of highest degree in v’ in (52) will be the term of ¥” associated 
with h. The other terms of 7” coming from (52) will yield points which lie 


** The number of such solutions is the length of the horizontal projection of 1. 


m 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 25 


on 1 or on / produced to the left. The lowest power of v’ which figures effec- 
tively in (52) is the p-th power. This means that #” has a term in u7-P:?y/? 
which is the only term in F” associated with the point (p,7—.p) and that 
every other point, plotted for ¥”, of abscissa p, has an ordinate greater than 
7— pip. Furthermore, all points of abscissa less than p which may be plotted 
for F” lie in the interior of the upper half-plane determined by 1. 

It follows that if p is less than the abscissa of h, P’ has a side of slope 

-p, which joins (p,7— ip) to h, and that this side has only a-points. 

Whether or not p is less than the abscissa of h, if there are points plotted 
tor F” of abscissa less than p, ®’ has sides of slope less than —p, and 
(p,t—pip) is the rightmost extremity of the rightmost such side. Any 
h-points which ?’ may have lie on sides of slope less than — p. 


44. If ?” has a b-point, lying on a side of slope — pp < — p;, an appli- 
cation of the method of § 38 shows that F' has a strong sum of one of the two 
forms 


45. We assume now that P’ has no b-points. Let q be the abscissa of 
the leftmost vertex of P’. 

For F’ to be annulled by v’ = 0, that is, for ¢,w% to be an element of F, 
it is necessary and sufficient that q exceed 0. 

Suppose that g >0. Then 05F’/dv’s vanishes for 7 <q but not for 
j}=q. This shows that ¢,w% is an element of F of multiplicity q. 

If g = 0, we show as in § 39 that ¢,w is not a strong sum for F. 


46. Suppose now that F has an element with at least two terms or a 
strong sum with at least two terms, the first term, in either case, being ¢,w” 
above. The arguments of §§ 41,42 show that — pz is the negative of the 
slope of a side of P’ and that dq» is a root of a certain equation 


(53) L2(p) = 0 


in which the number of solutions distinct from zero, which is the length of 
the horizontal projection of a side of P’, does not exceed p of § 43. 


47. Let us assume, then, that P’ has sides of slope less than —p;. Let 
— ps be the slope of some such side. We form the corresponding equation 
(58) and choose any solution ¢2 of (53) which is distinct from 0. We put 
in 
= + v”, 


whereupon F” goes over into an expression F” in v’. We form, as above, a 


F, 
ial 
le 
of 
al, 
in 
). 
u 
at 
4) 
d 
d 
) 


26 J. F. RITT. 


polygon ?” for F’”’. If p, is the multiplicity *° of ¢2, P” will have an a-point- 

call it h—of abscissa p, and, if P” has sides of slope less than — p2, h will be 
the rightmost extremity of the rightmost such side. If there are no such sides. 
h is the leftmost point on ?”. Such b-points as P” may have will lie on sides 


of slope less than 


48. We conclude the proof of the result stated in § 37. 

If P has a b-point, the index of F is o. 

Suppose that P has no b-point. Let r; stand for £, in § 41. By § 39. 
F will have 7; (possibly 0) zero elements. There will perhaps be certain 
possibilities for terms ¢,w% of other elements or of strong sums. The sum of 
r, and of the multiplicities of the ¢,; is £; of § 37. 

For each ¢,w" we find an F’. If some F” yields a b-point, the index of F 
is co. If no b-points are met, we proceed with each F’ as in §§ 45,47. We 
find that F has a certain number 7, = r of zero elements and elements ¢,u" 
and also, perhaps, a certain number of possibilities ¢,u + ¢.u" for the 
beginnings of strong sums or of other elements. The sum of r, and of the 
multiplicities of the is 

At the third step, we form an F” for each ¢.u". We continue in this 
manner. There are two ways in which our process, having been carried through 
k steps, may terminate at the ( + 1)-th step. Firstly, we may meet an F’ 
with a b-point. In that case, the index of F is «. Secondly, it may be that 
no #™ has a polygon with a side of slope less than the negative of the p, 
associated with that FP“. In that case, F’ will have no strong sum and will 
have precisely £; elements of the types 0 or 


in harmony with § 37. 
Let us assume that the process does not terminate in a finite number of 


steps. Then / has no strong sum and, from some step on, no new finite ele- 
ments appear. That is, if & is large, we will, in the first & steps, have isolated 
a fixed number r of finite elements and there may be in addition a finite number 
of possibilities. 


for the beginnings of elements with an infinite number of terms. The sum of 
r and of the multiplicities of the ¢x, for every large k, is £:. After the finite 
elements have been isolated, the number of distinct expressions (54) cannot 
decrease as & increases. Thus, after a certain step, there will be a fixed number 


15 Defined as for 9, 


des 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 2% 


of distinct expressions (54) and, when an F™ is determined for any of these 
expressions, we get a ¢x., With the same multiplicity as ¢,. An F™ with i 
large does not vanish for v™ —0 and its polygon has just one side of slope 
less than — px. 

This means that we are forming a certain number of infinite series which 


may be elements of F. Let 
(55) 


be any one of these infinite series. Let p be the common multiplicity of the 
dw in (55) with & large. We shall prove that (55) is an element of F of 
multiplicity p. 

First, we shall show that the px in (55) have a common denominator. 
Let, for some large k, 


(56) = 0 


be the equation, similar to (53), which determines ¢y. Then the degree of 
I; in @ is p and dx is a root of (56) of multiplicity p. Thus 


(57) = — dx)? 


with a a function of z. This means that the side of P%" of slope — px has 
on it points, plotted for #’*"’, of each of the abscissas 0,1,---,p. If Ao is 
that one of these points whose abscissa is 0 and if h, is the point of abscissa 1, 
px Will be the difference of the ordinates of hy and h,. These ordinates are 
linear combinations of pi,°° -,px-1 and unity, with integral coefficients. 
Hence p; is such a linear combination. Thus we can use, for the denominator 
of px, the common denominator of 

We prove now that (55) is an element of F of multiplicity p. Let s be 
the ordinate of the point on P of abscissa p. As seen above, s is independent 
of k for k large. 

For any large /, let polygons be formed in the usual manner for the p 
expressions /dvJ, =0,- -+,p. The discussion of (57) shows that 
the leftmost point of such a polygon will be on the axis of ordinates and will 
have for ordinate 


(58) s+ (p—j) 


For 7 =p, (58) equals s for every large k, but, for j < p, (58) becomes 
infinite with /. For any j, (58) is the lowest of the degrees in u, u, of the 
terms in the expression obtained by replacing in by 0. The 
same expression is obtained on replacing v in 0/F'/dv) by the sum of the first 


be 
les. 
39, 
ol 
Ve 
he 
is 
rh 
k) 
f 
] 


28 


k terms in the second member of (55). This shows that (55) is an element 


of F of multiplicity p. 
We have thus established the result stated in § 37.7¢ 


MULTIPLICITIES AND VANISHING DERIVATIVES. 
49. Suppose that (31) contains certain B;, say 


which involve v effectively and which vanish for u=v—0O. Let B be any 
of the B; in (59). 
Referring to Part V, we write 


(60) Sor — CB? + + U, Ber 


with 8 and B’ respectively the separant and derivative of B. The C are forms 
of order zero in v. Furthermore, 


Let a given relation (33) imply B=0O. Then the second member of 
(33) is an element of F. One proves as in S.S., § 55, that (33) gives an 
element of F of multiplicity p. Furthermore, (33) implies 


Ov 
for lo > 0, -+- p- 
FINAL CRITERIA. 


50. We develop now a test for determining whether F has either a strong 
sum, or an element which annuls no B; in (59) when substituted for v. 
Let each B; in (59) be written as a polynomial in v and let v% be the 
lowest power of v in B; whose coefficient is not divisible by wu. The equation 
i =0 has gq; solutions of the type (33). Let p; be the value of p in (60) 
for B=B;. Let 
(61) mM = + paqa- 


We compare m with ¢:. 
Suppose that £; > m. Then F has either a strong sum, or an element 


which annuls no B; in (59). 
Suppose that £; < m. Then £; cannot be the index of F, so that F has a 


strong sum. 


16 The matter of the areas in § 35 is handled as in S.8., § 54. 


| 


ent 


ny 


an 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 29 


Suppose that £: = m. We shall show, in what follows, how to determine 
whether the index of F is {; or oo. If the index is ¢;, F has no strong sum 
and every element annuls some Bj. 


51. We consider the q: +: - -+ qa solutions of type (33) of the rela- 
tions Bs = 0,i—1,---,d. Let k be a positive integer such that no two of 
these solutions coincide through their first k terms. 

We examine the process of §§ 38-48 for finding elements and strong sums 
of F. 

Let g be any positive integer not greater than &. If the process terminates 
at the g-th step, it must be either that we have encountered a P") with 
b-points or that no ? has a side of slope less than the corresponding 
—pg-1- The manner of termination would indicate whether the index is « 
or 

Let us suppose that the process does not terminate at the /:-th step or at 
an earlier step. We shall prove that the index is ¢;. 

The non-termination means that we have met certain ?-!) which have 
no b-points and have sides of slopes less than the associated — px. There 
will have been isolated a certain number 7; of elements which are either zero 
or possess at most k —1 terms. The sum of 7; and of the multiplicities of 
the which the yield is 

Thus, =m. Then the B; must have solutions (33) with at least 
k terms and the px, of the k-th terms must be negatives of slopes of sides of the 
(k-1) | 

Let 
(62) 


consisting of at least k terms, annul some Bj, say B;. Let (62) have multi- 
plicity p for F. Then p; in (62) is the negative of the slope of a side of the 
polygon for some F'*-») which, for the substitution 


yields an F™ which is annulled by 
(63) 


One proves now, following S.8., §§ 61, 62, that, if the polygon of F™ 
has sides of slope less than — px, p is the abscissa of the right extremity of 
the rightmost such side. If there are no such sides, p is the least abscissa of 
the points plotted for F™. One sees also, as in 9. 8., § 63, that (63) is zero 
if and only if p is the least abscissa of the points plotted for F™. 


ms 
of 
= 
ng 
he 
on 
0) 
nt 
a 


30 


We suppose that (63) is not zero, so that the polygon P™ of F“™ has at 
least one side of slope less than — px. We shall prove that P™ has a single 
such side, namely, a side which has slope — px,; (as in (63)) and has its left 
end on the axis of ordinates. It will be seen also that this side has no b-points, 

Let / be the rightmost side of # whose slope is less than — px. As in 
S. 8., § 64, it is seen that the slope of J is not less than — px. 

Suppose now that / has a b-point and let the rightmost such b-point be 
denoted by h;. The abscissa of h, is less than p. 

Suppose first that one of the terms associated with h, involves v,“ ; let 
it be a term in 
(64) lig), 
with 1, >0. Let 

Guth 


By § 49, K-is annulled by (63). We form a polygon for K. A point for FP 
yields, for K, a point J, + J. units to the left, or no point, according as the 
point for F™ does or does not have a term associated with it which is divisible 
by (64). As in S.S., § 64, we see that (63) does not annul K. 

Suppose now that the terms of / associated with h, are all free of v,“. 
Then wu, appears in those terms. Denoting by q the abscissa of h,, we let 


K = 
Then a point plotted for F“ yields, for K, a point q units to the left, or no 
point, according as the point for F is or is not associated with a term divisible 
by v%. In particular, h, yields a point hz on the axis of ordinates and the 
right end of / yields a point which, when joined to hz, produces a leftmost 
side, call it /,, for the polygon of K. The slope of J, is that of J and hence is 
not less than — px.1. When (63) is substituted into K, there will result, from 
the point h., terms involving wu, whose degree in u and wu, is the ordinate of hs. 
The only other terms of K which can conceivably produce terms of this degree 
are those associated with the points on J, other than h,. Such points are 
a-points and cannot yield terms which involve u,. Thus, (63) does not annul 
K, so that / has no b-point. 

The discussion is continued as in S.8., § 64. 

An argument like that in 8. 89., § 65, shows that the index of F is &. 
Our discussion has taken care of a set of F for which the sum of the multi- 
plicities of the $x is at least £: — rx. This shows that all F™ are accounted for. 


la 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 31 


SUFFICIENT CONDITIONS. 


52. We denote the general solution of F by Mt. 

Let F have an element, given by (33). We calculate the successive deriva- 
tives of v in (33) formally, expressing each derivative as a polynomial in the 
derivatives of w with series in w as coefficients. 

Suppose that every form which holds Yt vanishes identically in 
v,U,U1,° *, When are replaced in the form by their expressions 
found from (33). We shall say, in that case, that the element given by (33) 
is in M or belongs to M. 

We say that if F has an element which belongs to Mt, the solution 
u=v=0 is in Mt. In short, if a form G possesses a term free of the wi, vi, 
the substitution of v as in (33) into G cannot produce zero. 

Let us prove that if there are B; as in (59), no element in Mt can annul 
any B; Let B be any such B;. Referring to § 31, we denote by F the 
resultant of B and Cy with respect to v. Then R, which is a non-zero form 
in wu, vanishes for all solutions of B which are contained in Mt. If Gi,- - -, Gq 
is a finite set of forms whose manifold is Mt, some power of F is a linear com- 
bination of B,G,,- - -,@q and their derivatives. Any element in Mt which 
annuls B would thus annul R. As R does not involve v, R cannot vanish for 
such an element. 

We prove now that an element of # which annuls no B; in (59) is in M. 
Referring to (31), we let 

T = B,B.: By. 


If Q is any form which holds Mt, QT holds F. It follows that every element 
of F which does not annul 7 annuls Y. Now a B; in (31) which is annulled 
by an element must involve v and must vanish for u—v—0. Thus an ele- 
ment which annuls no B; in (59) does not annul T and hence is in Me. 

Thus, for uv = 0 to be in M, it is sufficient for F to have an element 
which belongs to I. If F/ has elements and if there are no B, as in (59), 
every element of F is in Mt. If there are B; in (59), an element of F is in M 
if and only if it annuls no B;. 

53. We shall prove now that if F has a strong sum, u = v = 0 is in M. 
Let F have a strong sum given by (36). We arrange the sum (37) as a poly- 
nomial in uw, and then equate it to zero. We secure an equation 


(65) Ao -t- -+ +- Aquy! 0 


where the A are sums of terms of the form bw’, with 6 a function of x, and p 


at 
rle | 
oft | 
ts. 
in 
be 
| 
et 
k) 
he 
le 
j 
0 
le 
1e q 
st 
is 4 
ag 
e | 
e 
fe 


32 J. F. RITT. 


rational. Regarding (65) as an algebraic equation for u;, we use the Newton 
polygon method to form for it solutions of the type 


(66) Uy, =yiu" + you" 


with the y and o as usual. Because wu, figures in the terms of lowest degree 
in (37), there will be at least one solution (66), (possibly identically zero). 
with o, = 1. We consider such a solution (66) and denote by p the common 
denominator of its o. Considering (66) as a differential equation, we put 
u==w?, Then (66) goes over into a differential equation 


(67) w, = f(x, w) 


with f analytic for z in some area and w small. Furthermore, f(x, 0) is zero 
for every x. Equation (67) admits a one-parameter family of solutions 


w = p(x, ¢) 


where y is analytic for z in some area and for c small, and where y, without 
vanishing identically in 2 and c, vanishes identically in z for c—0. Repre- 
senting the p-th power of y by ¢, we see that (37) vanishes for u = if c is 
small and distinct from 0. For x in a suitable area, we obtain, on replacing 
u by € in (36), a one-parameter family of analytic functions v which tend 
uniformly towards zero as c approaches zero. We have thus a one-parameter 
family of solutions of F which approach uniformly, as c decreases, the solution 
u=v=0. For c small these solutions cannot annul any B in (31). This is 
because the second member of (36) is not an element of F. Hence, for c small. 
we get solutions in 9. This proves that u—v —0 is in Me. 


STATEMENT OF NECESSARY CONDITIONS. 


54. In the sections which follow, we shall prove that if u =v = 0 is in 
M, at least one of the following conditions is satisfied : 


(a) The manifold of the form u belongs to M. 
(b) F has an element which belongs to M. 
(c) F has a strong sum. 


That the satisfaction of any one of these conditions insures the presence 
of u=v = 0 in M is already known to us. 


on 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 33 


A NORMALIZATION. 


55. It may be necessary later to interchange the letters uw and v in F. 
We represent F' by (u,v). When wu and v are interchanged we secure a form 
which we shall denote by F'(v, uw). 

We are going to prove that if one of the three conditions of § 54 is satis- 
fied by F'(u, v), then some one of those conditions is satisfied by F'(v, uv). 

If (a) is satisfied, F(v,w) has the element 0, which belongs to its general 
solution. 

Let (b) be satisfied. If the element described is zero, the general solution 
of F(v,u) contains the manifold of w—0. If not, the inversion of (33) 
produces an element of /'(v,w) in the general solution of F(v,uw). The fact 
that the coefficients in (33) have a common area of analyticity permits the 
inversion to be made. 

We prove now that if F(u,v) has a strong sum, F(v,u) has a strong 
sum. Let / (u,v) have a strong sum given by (36). We know from earlier 
work that if the second member of (36) is zero, then v =u? with p suffi- 
ciently large and rational is a strong sum for F(u,v). We assume in what 
follows that the second member of (36) is not zero. . 

For v as in (36) and for u,; as in (66), F(u,v) vanishes identically in 
uand x. Equation (36) defines u as a power series in v, 


with y, > 0. We replace w in (66) by the series in (68). We replace uw; in 
(66) by its expression found from (68) by differentiation. This gives a rela- 


tion between v, and v which reduces to a form 
(69) V1 = + + - 


with 8, =1. Then F(u,v) vanishes identically in v and « for wu as in (68) 
and v; as in (69). On the other hand, because the second member of (36) 
is not an element, /'(u,v) does not vanish identically in 2, v, v, for u as in 
(68). When the substitution (68) is made in F(u,v), F(u,v) goes over 
into a polynomial 


(70) Ao + Ay, + 


with the A series in v. Because (70) vanishes when v, is taken as in (69), 
the Newton polygon for (70) must have a side of slope not greater than — 1. 
If, instead of substituting into F(u,v) the entire series in (68), we substitute 
a sufficiently long segment of the series, we secure an expression similar to 


3 


n 

ut 

It 

e- 

is 
ig 
“ 

r 

is 

| 

| 


34 J. F. BRITT. 


(70) with the same Newton polygon which (70) has; this is because the new 
expression differs from (70) by terms of high degree in v. 

This means that if we take a sufficiently long segment of the series in 
(68) and replace v in the segment by wu, we obtain a strong sum for F'(v, u). 


56. Continuing with F' described as in § 30, we make the assumption 
that u =v = 0 belongs to Mi. Suppose that there exists a form, holding Mt, 


of the type 
w+A 


where A either is identically zero or else has each of its terms of degree higher 
than p in the wi,v;. By Part III, there cannot exist a form holding 9 which 
is of the type 

vi+B 


where B either is identically zero or else has each of its terms of degree higher 
than in the uj, 


57. Let F be as in § 30 with u—v —0 in M. 

Suppose that F’ does not involve wu. Then v = 0 is in the general solution 
of F considered as a form in v alone. Thus, zero is an element of F' in Mt and 
(b) of § 54 is satisfied. If F does not involve v, (a) is satisfied. 


58. On the basis of §§ 55-57, we assume that F, described at the start 
as in § 30, has 0 in its general solution Mt, that involves both u 
and v effectively 17 and that no form u? + A as in § 56 holds Mt. It will be 
proved, in what follows, that one of (b) and (c) of § 54 is satisfied. 


NECESSITY PROOF. 


59. We denote by S the separant of F for the order v,u of the 
unknowns.'* Let p be any positive integer. We consider the forms 


(71) 


where F, is the 1-th derivative of F. 

We now regard the F in (71) as simple forms. The unknowns u, in the 
F will be wo,° - -, Up, where p’ is p or p+ 1 according as the order of F' in 
uis0ori1. The unknowns 1; are v,° -, Vp" with p” either p or p+ 1. 


17The assumption that F involves v is made principally for definiteness of pro- 
cedure in § 59. 
18 Note that F involves wu. 


ew 


on 
1d 


be 


1€ 


le 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 35 


The totality of simple forms which vanish for all solutions of (71) with 
S0 is a prime system A. A has the solution uj—0, 1=0,---, 9’; 

In A, we replace each uj; with 1 >0 by wo2i, with z a new unknown. 
We replace v;,i=0,---,p” by uowi. Then A goes over into a system © in 
uo, the z and w. As A has solutions with wu) 0, © also has such solutions. 
The totality 2 of forms in uw, the z and w which vanish for all solutions of = 
with U0 is a prime system. 

We shall prove that © has solutions with u)—0. Let this be false. Let 
(,,: + +, Gq bea finite subset of forms of 2 with the same manifold as Q. Then 


Uo, Gi, Ga 
has no solutions, so that there exists a relation 
1 = Kou K,G, -{- +- K Ge. 


Then 1— Kyu, isa form inQ. We prove as in S.S., § 69, that A has a form 
M of the type uo? + A with each term of A of degree higher than g in the 
wi, vi. M as a differential polynomial holds M. This contradicts § 58. Thus 


has solutions with u, = 0. 


60. According to S. S., § 68, there exists a set of functions ¢(x,h), one 
for each unknown in Q, analytic for 2 in some area and for h small, the @ for 
uo reducing to 0 for h =0, which annul the forms in Q but not wJS. 

Passing from © to A, we have an analogous set of $(2,h) for the 
unknowns in A. Let r be the lowest power of h in the expansion of the ¢ 
which corresponds to uv. Then the expansion of any other ¢ which is not zero 


has a least power of h no less than r. 


61. Let ® be some finite system of differential polynomials in wu and 1, 
containing /', whose manifold is M. 
We select a value a of « such that: 


I. The coefficients in ® are analytic at a. 
II. Every (2, h) for A, as in § 60, is analytic for =a and for h small, 
III. The coefficient of h* in the ¢ corresponding to uw» does not vanish at a. 
IV. S does not vanish identically in h when its unknowns are replaced 


by their ¢ and z is put equal to a. 


62. When z is replaced by a in the ¢, the ¢ go over into functions of h 


in 

on 
t, 
ler 

ch 

er 

rt 


36 J. F. BRITT. 


which are analytic for h small. We denote the functions of h associated with 
the u; by the generic symbol « and those associated with the v5 by B. The a 
for WU has a zero of order r for h =0. Every other « which is not identically 
zero, and every B which is not identically zero, has a zero of order at least r 
for h=0. For «=a, the F in (71) vanish identically in h when the 
unknowns are replaced by their a, ~. 

Let F; denote for i> p, as above for i p, the i-th derivative of F. 
In F'p,:, we replace x by a, the unknowns other than wy,, and vp",, by their 
a and B, and vp",, by 0. Equating F,,, to zero after these substitutions, we 
secure a linear equation for up, Because of IV of § 61, wp, is determined 
as a function of h which is either analytic, or else has a pole, for h =0. We 
treat F’p,2 similarly, making the substitutions described above and, furthermore, 
replacing Up",2 by 0 and up; by the function of h found above. We find, for 
Up's2, & function of h which is either analytic, or has a pole, for h = 0. 

The process just described determines a one-parameter family of solutions 


in M, 


The 8, and the a with k= p’, have zeros of orders at least r for h = 0. 
The « with k > p’ may conceivably have poles for h = 0. 
63. Let 


Let q be the highest of the orders of the derivatives of wu and v which 
appear in ® of §61. We assume that p of § 59 is taken greater than q. 

When the second members of (72) and (74) are substituted for v and w 
respectively in any form G of ®, G goes over into a series 


(75) 


r 


with the y analytic for =a. We prove as in S.8., § 71, that every y which 
is not identically zero has a zero at a of order at least p— q. 

Representing w in (74) by wu, we find, as in S.8., § 72, that wu satisfies, 
for h small, a differential equation 


(76) Uy = pot + pau 4+--- 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 37 


with » which are functions of 7, analytic for =a. With this same meaning 
for u, we find for v as given by (72) an expansion 


with vy which are analytic for « =a. 


64. Following S.8S., § 73, we find from (76) and (77), by differentia- 
tion, expansions in powers of u/" for u; with iS q and v; with 1 = q where 
q is as in § 63. These expansions contain no power of w lower than the first 
power. 

Let these expansions be substituted into any form G in ® Then G 


hecomes an expression 


(78) tou + - 


As in 8S. 8., § 73, we prove that every £ which is not identically zero has a zero 
at a of order at least p— q. 
As in S.8., § 74, it is possible to find a value a of x which can be used 


for a sequence of values of p increasing to . 


65. Referring to the necessity conditions of § 54, we assume that F, 
described as in § 58, has no strong sum. We find in §§ 65,66 an element 
which belongs to Me. 

We take over the discussion of expansions in §§ 76-81 of S. S., speaking 
here of w-expansions. 

Choosing a sequence of values of p which increase towards oo, we form, 
for each p, a pair of series of the types shown in (76) and (77). Without 
loss of generality, we assume that these sequences of wu and v, and also the 
sequence of v, which they give by differentiation, have strong characteristics. 

When v and wu, are replaced in F by the corresponding series in (76) and 
and (77), we secure from F, according to § 64, a sequence of u-expansions 
converging to zero. We are going to extract from the sequence of v a sub- 
sequence which converges to an element of F. Later it will be shown that 
this element is in Mt. 

Suppose first that the v converge to zero. We shall prove that zero is an 
element of F. Let this be false. Then F' has terms free of v and v;. Because 
F has no strong sum, ? has no b-points. Thus, among the terms free of v 
and v,, there is a term of the type au” which is of lower degree than any of 
the other terms. When v and u, are replaced in F by their u-expansions, every 
term of F which involves v or v, produces a sequence of u-expansions con- 


th 
ly 

ir 

Ve 

1s 

h 


38 J. F. RITT. 


verging to zero. The terms free of v and v,, other than ew%, produce sequences 
of characteristics exceeding g. This shows that zero is an element of F. 

Now, suppose that the sequence of v has a finite characteristic p,. Of 
course, p,=1. We prove, as in S.S., § 84, that P has sides and that — p, 
is the slope of a side. Continuing as in S. S8., we find a subsequence of values 
of p, and a first term $,u of an element of F, such that, for the p of the 
subsequence, the expansions v—q¢,u" form a sequence of characteristic 
exceeding 

Let F go over into an expression F” in v’ and u under the substitution 
(48). Then the substitution of v and uw, as in (77) and (76), into F pro- 
duces the same w-expansions as the substitution of 7 —v—d¢,u" and y, 
into F’. Using the fact that the polygon of F” has no b-points, we prove 
that, if the v’ converge to zero, ¢,u" is an element of F. If the sequence of v’ 
has a finite characteristic pz > pi1, we continue as in 8.8., § 85. Proceeding 
in this manner, we find a sequence of v as in (77) which converges to an 


element of F. 


66. We shall prove that the element just obtained—call it vo>—belongs 
to M. 

Let v, not belong to Mt, and let it annul some form—call it B—of (59). 
We shall produce the contradiction that F has a strong sum. 

We consider the form C, of (60) which is secured for B. The solutions 
of B which belong to Mt annul Cy. Let %:,- - -, 3%. be a decomposition of 
the system B, Cy into closed essential irreducible systems. According to § 31, 
each 3; has a basic set 
(79) Ui, Vi 


introducing wu and v in succession, with U; algebraically irreducible and of 
order at most unity in u, and with V; of order zero in v. 

Let us consider those 3; of which =O is a solution.1® We say 
that, for at least one of them, U; is of order unity. Let this be false. If a Ui; 
is of order zero and vanishes for u = 0, that Ui, being algebraically irreducible, 
must be uw multiplied by a function of Each 3; of which =v = 0 is not 
a solution contains a form 1+ A where A vanishes for u=v—0. We 
conclude that the system B, Cy is held by a form M given by 


(80) M—=u(14+ 2H) 


1° Because > annuls B, u=v=0 annuls B. 


208 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 39 


where H vanishes for u=v—0. Thus M holds the system obtained by 
adjoining B to , so that some M? is a linear combination of B, the forms of 
® and their derivatives. 

We now consider the sequence of u-expansions which converges to vo. 
We substitute successively these expansions for v, and the associated expansions 
for u,, into the linear expression just found for M%. Any form of &, and any 
derivative of such a form, will yield a sequence of expansions which converges 
to zero. Also, for a sequence of expansions converging to vo, and for the 
related u,, B and its derivatives yield sequences which converge to zero. Thus 
M® yields a sequence converging to zero. This is impossible because wu’ is a 
term of lowest degree for M9.?° 

Let then 3, admit u =v = 0 as a solution and let U, be of order unity. 
We consider the equation 
(81) U,=0 


as an algebraic equation for u,;. For x in a suitable area %,, and for w small, 
the solutions of (81) can be expressed as series of ascending fractional powers 
of wu with coefficients analytic in Y%,. For x in Y, and for u small but not zero, 
these series give finite values for wu, which, for given values of x and wu, are the 
only numerical solutions of (81) for u;. None of these series annuls the 
initial of V, identically in w and wu. When any one of the series is substituted 
for wu; in V,, the equation V; = 0 determines v as any one of a finite number 
of series of ascending fractional powers of uw with coefficients which will be 
analytic in M, if 9, is shrunk appropriately. 

We obtain thus a certain number of related pairs of series for wu, and v. 
For x in Y, and for w small but not zero, such a pair of series gives finite 
values of wu, and v, and the pairs of values of wu, and v from all of the series 
will be the only numerical solutions of U; = 0, Vi —0, for given z and wu. 

Because wu does not hold 3,, the solution u =v = 0 of %, is approximable 
by solutions @, 7 with 740. There are points b in M, for which we can get 
i and @ which, with any finite number of their derivatives, are as small as one 
pleases at b, while i(b) 40. There must be some one related pair of series 
for u, and v which, for some sequence of @ approximating more and more 
closely to u = 0, give a, and @ when wu is replaced by a According to A. D. E., 
§ 89, the lowest exponent of u in the series for wu, is at least unity. Because 
ii(b) and 3(b) can be made arbitrarily small, the series for 7 must contain 
only positive exponents of u. Thus there is a pair of series 


2Tt is unnecessary to use strong characteristics here. In S8.8., § 81, the char- 
acteristic of (159) is not less than a + £, even if a and # are not strong. 


)f 
Les 
he 
ic 
on 
0- 
Uy 
ve 
v 
ng 
an 
gs 
ns 
of 
of 
ay 
1 
e, 
ot 
e 


40 RITT. 
(83) v = Bul’ +- 


which annul U,; and V;. The remainder of Cy with respect to the ascending 
set U,, V; is zero. Because the initial of V, does not vanish for (82), C) is 
annulled by (82), (83). Similarly, B is annulled by (83). 

We now examine (60). 

Let the series in (82) and (83) be denoted by @% and v. Let Co be 
expanded in powers of uw, — %, and v—v, with u-expansions for coefficients. 
The expression for Cy will contain no term free of u,—%@, and v—v. 
Because C, does not vanish identically in x, u,u, for v =v, the expression 


contains a term free of v—v. Let 
(84) Co =i — 11)" + yt (ti — th)! 


where the y are u-expansions and the g increasing positive integers. The 
terms which follow y:(u: — in (84) involve v —v. 
In (60), keeping C in its original form as a polynomial in v, uw, u, we 
make the substitutions 
v=v+w 


(85) 


with w an indeterminate and w, its derivative. Then each term in the second 
member of (60) goes over into a polynomial in w and w,;. The coefficients 
in these polynomials are polynomials in u,; whose coefficients are series of 
rational powers of u. B? will give a polynomial in w in which the least 
exponent of w is p. The power products BB’ will produce polynomials in 
w and w, with terms all of degree at least p+ 1. 

Let the coefficient of w? in the polynomial yielded by B? be a u-expansion 
in which the lowest exponent of wis h. If we replace w by wv’, with p a positive 
integer greater than h, we see that the replacement of v by v + w? in B? 
produces a u-expansion in which the least exponent of wu is h + pp. 

In (84), let the sum 


(86) ¥1 (ti — + + yt (u— 


be written as an infinite sum & of power products in wu, and wu. Because (86) 
equals u, — %, multiplied by a sum analogous to 3, and because the terms in 
ii, are all of degree at least unity, the terms of lowest degree in ¥ must involve 


g 


le 


n 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 41 


u;. Let & be the degree of the terms of lowest degree in 3. Let p above exceed 
i as well as h. Then the substitution v =v + w into C)B? produces a set 
of terms in which the terms of lowest degree are of degree k + h + pp and 
involve 

Now the substitution (85) with w = wv’, practiced on any term after the 
first in the second member of (60), yields terms in u and wu, of degree at least 
(p+1)p. Let p>h-+k. Then the terms of lowest degree coming from 
the second member of (60) will involve w,. , 

Because S in (60) is free of wu, v1, it must be that the substitution (85) 
with w = wu? reduces F’ to a sum of power products in w and u, in which the 
terms of lowest degree involve u,. If, then, v’ is a sufficiently long segment 
of v, the replacement of v in F' by v’ + wu? will produce a sum (37) in which 
u, figures in the terms of lowest degree. Thus v’ + vw is a strong sum for F. 

Thus vy is an element of F in Mt. The necessity of the fulfillment of at 
least one of the conditions of § 54 is established. 


SUMMARY OF TEST. 


67. Let us summarize the method for testing whether u—v=0 is 
contained in Mt, the general solution of F. We recall that F is algebraically 
irreducible, that the maximum of the orders of F' in u and v is unity and that 
u=v =O is a solution of F. 

One first secures the decomposition (31). If there are no B; as in (31) 
or no such B; which are annulled by u—v —0, then u—v —0 is in M. 
In what follows, we assume that B; exist, in (31), which vanish for u =v = 0. 

One tests now as in Part V to see whether the manifold of the form wu is 
contained in Mt. An affimative answer means that u—v—0O is in Mt. In 
what follows, we assume that §? does not contain the manifold of w. 

We compare ¢; of § 37 with m of § 50, extending the definition of m so 
as to have m = 0 if there are no By as in (59). If £1 Am, u—v —0 is in Me. 
If £; = m, u = v = 0 is or is not in Mt according as the index of F is © or £3 
the index is determined as in § 51.?! 


ONE-PARAMETER FAMILIES. 


68. In § 31 it was stated that, if U in (32) is of order unity, the mani- 
fold of & is contained in i. We show now how this is proved. 
Let u = ¢, v = y be any solution of &. 


Tf ¢,= 0, the index is 0. Otherwise the single point of which P is composed 
would be a b-point. This would imply that 9} contains the manifold of w. 


| 

is 

ye 

n 

| 

e 

d 

yf 

st 

n 

n 

e 

e 


42 J. F. RITT. 


Following § 66, we prove that U, V, B, Cy are each annulled by a pair of 
series 
(87) = $1 + + (u— 


with ¢, the derivative of ¢. We then use (60) to show that the substitution 


(88) (u—¢) 


with v the series for v in (87) and p a sufficiently large integer, reduces F to 
a sum of power products in u— ¢ and u, — ¢,; with u, — ¢, present in the 
products of lowest degree. The argument of § 53 is then used to show that 
the solution u = ¢, v = y of F can be approximated by solutions in Mt. Thus 
v=y is in M. 
EXAMPLES. 
69. Example 1. Let 


A =u(uv, + wu, — — (v—u)?. 


Let F be the form, algebraically irreducible in the field of all rational functions 
of x, defined by 


uF — v4? —T] (v—u 


j=0 


Then has no b-point, £; = 4, m= 5. M contains u—v —0. 


Example 2. Let 
A = Uv, + uu, — 


Let 
(v—u+ ju’). 


j=0 


Here ? has no b-points and {;=m-=3. Carrying out the substitution 
v=u-+v’, we find that P’ has a b-point, so that u—=v —0 is in M. 


Example 3. Let A =v? — u® and let A; be the derivative of A. Let 
F=A,’?—A. 


Then ? has no b-point, and £; = m = 2. By § 51 we see that the index is 2, 
so that u = v = 0 is not in M. 


Example 4. The similarity of the preceding examples to examples of 
8. 8. might lead one to ask whether the problem of the present paper cannot 


= 


r of 


‘lon 


to 
the 
hat 


hus 


ns 


onl 


of 
ot 


POINTS IN THE THEORY OF ALGEBRAIC DIFFERENTIAL EQUATIONS. 45 


be reduced to that of 8.8. by replacing v by a form of the first order in wu, 
for instance, by wu, Let F—u-+v,?. Then P has a b-point so that 

=v =0 isin Mt. If we substitute for v a form of the first order in u which 
vanishes for u—0, F becomes a form in wu for which u = 0 is an essential 


manifold. 


Example 5. Let 
F =v,?(¥i —u) + v(v—u). 


The manifold of / decomposes into Mt and the manifold of v. P has no 
b-point, £; = 2, m 1. We observe that zero and wu are elements. By § 41, 
if F had a strong sum, the first term of the strong sum would be u. If we 
put, in F, 

pol? 


with $24 0, p2 > 1, the term v(v—u) in F produces a sum of powers of u 
in which the least exponent is p2-+1. The first term of F produces power 
products in uw and uw, of degree no less than p2 + 2. There is consequently no 


strong sum, and the index is 2. 

One might ask whether, in the case in which F has a finite index and 
there are elements in Mt, the elements in Mt have to be algebraic with respect 
tou. At least in the case in which F has constant coefficients, the answer is 
affirmative. The general question should be interesting to examine. 


Example 6. Let F=v,?+ (u.+v)v. The manifold of F consists of 
M and the manifold of v. The two irreducible manifolds have in common the 


one-parameter family u—c, v = 0. 


CoLUMBIA UNIVERSITY. 


= 

= 
2, 4 


THE ANALYSIS OF THE DIRECT PRODUCT OF IRREDUCIBLE 
REPRESENTATIONS OF THE SYMMETRIC GROUPS.* 


By F. D. MurNaGHAN. 


Let T and I” denote irreducible representations of the symmetric groups 
on m and n letters, respectively, with characteristics ¢ and ¢’. Their direct 
product ['- IY is a representation, in general reducible, of the symmetric group 
on m + n letters with characteristic ¢¢’. We have given in a previous paper 
(1) the analysis of ['- I” into its irreducible components for all representations 
r,I” for which m-+-n=9. In the present paper we present a refinement 
of the method used in (1) which makes the computations very much easier 
and add the table giving the analyses of the products [-I” for which 
m-+-n==10. The simple characteristics ¢ of the symmetric group have been 
termed Schur-Functions (= S-Functions) and the problem under considera- 
tion has been treated recently under the title “ Multiplication of S-Functions ” 
by Littlewood and Richardson (2) who have proposed a scheme involving the 
construction of various tableaux (based on the partitions of n with which the 
irreducible representations of the symmetric group on n letters are associated). 
They have, however, been unable to present a proof, in the general case, of the 
theorem on which their proposed scheme rests; and, unfortunately, the example 
they give to illustrate the operation of their method (namely, the direct product 
of the irreducible representation T = D(4, 3,1), of dimension 70, of the sym- 
metric group on 8 letters by the irreducible representation IY = D(2?, 1), 
of dimension 5, of the symmetric group on 5 letters) has the result incorrectly 
printed. The tableaux they give furnish the correct result and so the error 
in the final result must be ascribed to the printer or to an oversight in reading 
off the representations associated with the various tableaux (of which there 
are 34). That this error was not immediately detected will not appear sur- 
prising when we remark that the direct product being analysed is a representa- 
tion of dimension 450, 450 of the symmetric group on 13 letters. The method 
we give in the present paper makes it easy to read off the analysis of this 
representation in less than five minutes. We furnish rules, with illustrative 
examples, which make the analysis of ['- I” simple if m + n= 16. These rules 
depend for their construction on the tables, given in our paper (1) for n= 9, 
which furnish the analysis of the reducible representations A(A) of the sym- 
metric group on 7 letters; and on tables, reciprocal to these, which express 


* Received October 6, 1937. 
44 


LE 


ps 


eT 


ANALYSIS OF THE DIRECT PRODUCT OF IRREDUCIBLE REPRESENTATIONS. 45 


each simple characteristic of the symmetric group on n letters as a linear 
combination of the characters of the reducible representations A(A) of this 
group. These tables were originally furnished by Kostka ((8), (4), (5)) in 
connection with his fundamental researches on symmetric functions; his 
method for constructing them is, however, quite unnecessarily complicated and 
it is not surprising that the usefulness of his tables for the present problem 
has hitherto escaped attention. As his tables may not be easily accessible to 
those working in nuclear physics (for whom, principally, this paper is written ) 
we give the necessary “ reciprocal tables ” for n= 9. The calculation of these 
tables is very simple, involving not more than a few hours work; in fact the 
simplest construction of the tables of our previous paper, which furnished the 
analysis of A(A), appears to be the following. First construct the tables of 
the present paper; each table is a triangular matrix with diagonal elements 
unity and, hence, having its determinant unity. The tables furnishing the 
analysis of A(A) are obtained by taking the reciprocals of these triangular 
matrices; a procedure involving merely a recurrent transposition of terms 
from one side of a system of linear equations to the other. 

Since each simple characteristic @ of the symmetric group on n letters is 
the character of a rational, homogeneous, irreducible representation, of degree 
n, of the full linear group (= group of all non-singular linear homogeneous 
transformations in p variables) (6) the results of the present paper furnish 
the analysis of the Kronecker product of any two such rational, homogeneous, 
irreducible representations of the full linear group. 

It is pleasant to conclude this introduction by remarking that our interest 
in the problem has been greatly stimulated by the queries of our friends Pro- 
fessors J. A. Wheeler and E. Wigner whose fundamental researches in nuclear 
physics require the analysis of I'- IY given here. 

1. Notations and outline of the general method. Let 


(e) = (a, €2,° *,€n), + 


be any partition of nm and let ¢)(s) be the simple characteristic of the 
symmetric group on n letters which is associated with the partition (e) ; 
similarly let 

(v) = (1, * 
be a partition of m and let ¢,v)(s) be the corresponding simple characteristic 
of the symmetric group on m letters. Then the product ¢«)(8)¢,v) (8) is a 


*We shall assume the reader familiar with the methods and notations of our 
previous paper (1) and shall refer, where convenient, to this paper by merely giving 
the page number. 


ct 
p 
18 
nt 
er 
e 
e 
e 
t 
y 


46 F. D. MURNAGHAN. 


linear combination of the simple characteristics ¢,a)(s) of the symmetric group 
on n + m letters. We have given (p. 480) the following rule for determining 
the coefficients of this linear combination. Regarding (s) = (58, 82,° - -) as 
the power sums of n + m variables 


(8) is a linear combination of the symmetric functions 
(1) 


T (2) = + 2n™ of degree n in the n + m variables (the summa- 
tion in the expression for ¢,«)(s) being over all partitions (7) of n). Then 
those characteristics ¢,q)(s) of the symmetric group on n+ m letters occur 
for whick any member of [ («) — (7) | is the partition (v) of m. It is under- 
stood that [(a) — (a) ] means the aggregate of partitions of m obtained by 
subtracting (7), in all possible arrangements, from each group of n from the 
n-+ m letters (a); and further that any disordered arrangement in the set 
| (%) — (a)] is restored to the normal non-increasing arrangement in the 
manner described in (1) (p. 461). An equivalent statement of this result is 
clearly the following: add n zeros to (v) so that it appears as a partition of m 


Zeros—, 


with n + m elements (v) = (v1, *,vm,0,0,° ‘and add (7) in all 
possible arrangements to each set of n from the n+ m numbers (v). The 
resulting partitions of n + m are rearranged, if disordered, according to the 
rule referred to and those which do not vanish will appear in the product 
(8) with the coefficient the + sign being used if an even, 
and the — sign if an odd, number of inversions are necessary to bring the 
disordered partition of n + m into its natural order. 

It is clear that a slavish adherence to the rule just given would prove 
tedious and it would in fact be indicative of a lack of intelligence. For if 
the expression > ¢,r)T'x() be written as a polynomial in z, the various coefti- 

(7) 


cients are symmetric functions in the n + m—1 variables 22,° - -, Znm the 
coefficient of z,? being of degree »—p. Each such symmetric function of 
degree n— p may be expressed as a linear combination of the simple char- 
acteristics of the symmetric group on n—p letters; let aid’) (8’) be a 
term of this linear combination where the s’ are the power sums of the 
n+ m—1 variables z2,° -*,Znsm. Then amongst the terms of the desired 
product ¢,e)(8)¢ will occur terms Where (a) is obtained 
from any term ¢:g)(8’) occurring in the product ¢,v)(8’)¢ (s’) (where 
v) = *,¥m)) by prefixing an element »,-+ p. We hope the reader 
will not be confused by the number of words necessary to state the rule which 
is really simpler to apply than to describe; a few elementary illustrations will 
make it clear. 


up 
ng 


ANALYSIS OF THE DIRECT PRODUCT OF IRREDUCIBLE REPRESENTATIONS. 47 


The most elementary (indeed trivial) instance of our problem occurs 
when n == 1. Here there is only one partition (7) = (1) and only one simple 
characteristic (8) (8) =s, = T and the original form of state- 
ment of our rule is immediately applicable. We write (v) in the form 
(v1,° * *,>¥m,9) and add, in turn, unity to each element obtaining the result 
(p. 480) 


where we use the notation {v,,- - -,vm} to denote ¢(s). If m>0, 
View == Viwg ==" * * == Vm =O we omit the terms on the right in which there 
appears a unit preceded by a zero (p. 461) and our result appears in the 
simpler form 


The next case, namely multiplication by {2}, is not quite so trivial and 
enables us to glimpse the advantages of the modification, given above, of the 
original form of statement. We first remark that since p.(z) =1 (p. 449) 
{0,0,0,: -} =1 so that {0,0,0,-- Now 
(8) = po(%) = T (2) (2) + and writing this as a polynomial (of 
the second degree) in z, the terms independent of z, = ¢2(8’) ; the coefficient 
of 2; = ¢;(8’) whilst the coefficient of z,? is unity. Hence 


{2} {v;} {2} {v1, 0, 0, 0} 2} + + 1,1} + + 2}. 
From this we deduce 


{2} {v1, vo} = {v1, v2, 2} + ve + 1,1} + v2 + 2} 
+ {11 + 1, ve, 1} + {v1 + 1, v2 + 1} + {11 + 2, ve} 

and so on in general; the final result being that {2}- {n,v2,- + -,vj} is 
obtained by writing --,vj} in the form -,vj,0} and then 
adding 2 and (1,1) in all possible ways (p. 480). 

Multiplication by {17} is even simpler since (8) = o2(%) = T (2) (2). 
Writing this as a polynomial (of the first degree) in z, the terms independent 
of 2; = (s’) whilst the coefficient of z; = ¢,(s’). Hence 


{17} {ni} = 1,1} + + 1, 1} 


and so on, in general, the final result being that {17}- {v,v2,---,vj} is 
obtained by writing {v,,--~-,vj} in the form {n,---,vj,0,0} and then 
adding {1,1} in all possible ways (p. 481). 

We hope that it is clear from the preceding paragraphs that the essential 
steps in the calculation of (8) (8) are the following: 


ns 
en 
ur 
r- 
by 
he 
set 
he 
is 
m 
all 
he 
he 
ct 
n, 
ve 
if 
he 
of 
I- 
a 
he 
od 
re 
er 
h 
ll 


48 F. D. MURNAGHAN. 


(1) First express ¢;e)(s) in the form > ccm)7'(2)(%) and write it as a 
polynomial in z,. This step in the calculation has already been done 
for us (as far as nm 11) in the tables of Kostka referred to in the 
introduction ; we shall discuss it in the following paragraph. 

(2) Next express the coefficients of the various powers of z, as linear 
combinations of the simple characteristics of the appropriate sym- 
metric groups (the coefficient of z,” being expressible in terms of the 
simple characteristics of the symmetric group on n — p letters). This 
step has also been done for us in Kostka’s tables; we give below 
tables furnishing the necessary coefficients (as far as n = 9). 


When these steps have been performed the product ¢,)(8) (8) can 
be at once written down if the products ¢,,)(8) -¢v)(8) are known where 
(v’) = (ve,- * *,vj) and (A) is a partition of x, or less than n, letters. Thus 
the desired products are obtainable without difficulty by a recurrence method. 
Before proceeding to a description of the methods by which steps (1) and (2) 
are carried out in general it is probably desirable to illustrate the procedure 
by another simple example. 

Ezample. {2,1} {2,1}. 

We shall see below that {2, 1} = 7 (21) (%) + 27 13,(2). The terms free of 
z, in this second degree polynomial = ¢,2,1)(s’) and so the characteristics be- 
ginning with 2 in the desired product are found by prefixing a 2 to the product 
{1} - {2,1} i.e. {3,1} + {27} + {2,17}. Since {2,3,1} —0 (1, p. 461) they 
are, accordingly, {2*} + {2,17}. The coefficient of z, in our second degree 
polynomial is 32.7 + 232223 = d2(8’) + $a%)(s’) and so the characteristics 
beginning with, 3 in the desired product are found by prefixing a 3 to the 
product {1} - [{2} + {17}] i.e. {3} + 2{2,1} + {1%}. They are, accordingly, 
{37} + 2{3, 2,1} + {3,1°}. The coefficient of z,? is ¢,(s’) and so the char- 
acteristics beginning with 4 in the desired product are found by prefixing a 4 
to the product {1} - {1} i.e. {2} + {17}. They are, accordingly, {4,2} + {4, 1°}. 
Hence the desired product is 

{4, 2} + {4, 1°} + {8} + 2{3, 2, 1} + (8, P} + {2%} + {2?, 17} 

2. Expression of the simple characteristics ¢,.)(s) of the symmetric 
group on z letters in terms of the symmetric functions T(7)(z). This often 
treated problem is so simple that it can be discussed ab initio in a few lines. 
$<) (8) may be presented as a determinant of order n whose diagonal elements 
are Pe,(%), Pe.(Z),* °°, Pe,(%), the non-diagonal elements being obtained by 
methodically increasing (decreasing) by unity the labels attached to the p(z) 
as we move from each column to its neighbor on the right (left). An equiva- 


ANALYSIS OF THE DIRECT PRODUCT OF IRREDUCIBLE REPRESENTATIONS. 49 


lent statement of this fact is the following: let é; be an operator which de- 
creases by unity the value of the label carrying the subscript j (7 =1,2,---,) 
and let =«, + (n—1), + (N—2),°* ena en—en 
(so that > ->e,=—0). Then 


En”? 


If, therefore, we denote by K,,)(#) the characteristic p,---p,, of the 
reducible representation A(A) of the symmetric group on n letters we see that 
¢ie) (8) is a linear combination of the various compound characteristics K .,) (2) ; 
only those characteristics K,)(%) occurring for which (A) increased by the set 
(n—1,n—2,---,1,0), arranged in any order, gives the set (¢) = (¢1,°**, en) 
arranged in any order. The sign attached to K,,)(%) is plus (minus) if the 
arrangement of the set (n —1,n— 2,- - -,1,0) which is added to (A) to give 
(e) is even (odd). E.g., let («) = (3, 2,1) so that (e) = (5,3,1); to find 
the Ky) (%) which occur in the expression for ¢,3,2,1)(8) we subtract (2, 1, 0) 
in all possible orders from (5,3, 1) obtaining the six terms (3, 2,1), — (3, 3,0), 
(5,1,0), — (5,2,—1), (4,3, —1), —(4,1,1). Of these the fourth and 
fifth vanish since they contain a negative element (each p;(%) for which j < 0 
being zero) and s0 ¢(3,2,1) = K(3,2,1) — — + The same 
tule holds if we wish to find the expression for any symmetric function 
(%) = - of degree n in terms of the simple characteristics 
die) (8). In fact ¢.)(s) may be presented as the ratio 
: Afa—41,- - -,1,0) 

where A(v,,- --,Un) is the n-rowed determinant of which the elements in 
the j-th row are the v/-th powers of the indeterminates (2,- - -, Zn) 
(j=1,---,n) (p. 459). On multiplying 7')(%) by the Vandermonde 
determinant A(z) = A(n —1,:--,1,0) we obtain a collection of determi- 
nants A(v1,- *,Vn) and on dividing through by A(7~—1,- - -,1,0) we see 
that 7'.,)(z) is expressible as a linear combination of the simple characteristics 
¢<)(8) ; only those simple characteristics ¢¢)(s) occurring for which (e) 


increased by the set (n—1,---,1,0) i.e. (e) is the same as one of the 
sets (A) + any arrangement of the set (n—1,---,1,0). The sign attached 
to ¢e)(s) is plus (minus) if the arrangement of the set (n—1,- - -,1,0) 


which is added to (A) to give (e) is even (odd). Hence the coefficient of 
(%) in the development of is the same as the coefficient of (8) 
in the development of 7’.,,(z) (a result apparently first observed by Kostka). 
The second step in our general procedure is therefore easy provided we know 


4 


Sa 
ne 
he 
ar 
m- 
he 
lis | 1 va | 
OW 
an 
re 
us 
2) 
ITe 
of 
e- 
ct 
ey 
ee 
cs 
ne 
r- 
4 
ic 
§ 
) 
| 


50 F. D. MURNAGHAN. 


the expansions of the simple characteristics ¢,e)(s) in terms of the compound 
characteristics K,,)(%). For partitions (e) containing not more than three 
non-zero elements these expansions are trivial since they involve nothing more 
than the expansion of a determinant of order three or less. If (¢) has only 
one non-zero element ¢:¢)(s) appears as a determinant of one row and 
die) (8) = Kye) (2). g., dca) (8) = Kea) (2). For partitions with two non- 
zero elements we have to expand a two-row determinant; E. g., 
ps(%) ps(2) 
Pi(%) 
so that $:4,2)(8) = K (4,2) (2) — Kis,1)(%). For three non-zero element parti- 
tions we proceed similarly; E. g., 

ps(%) po(%) 
$(3,2°)(8) =| pi(%) po(3) ps(%) | = p2?(%) — ps” (%) pi (3) 

pi(%) po(%)| — ps(%) p2(3)pi(%) + ps(s) pr’ () 

+ ps(%) ps(3) — ps() 


$ (4,2) (8) = = ps(%) — ps(%) pi (2) 


and so 


(3,2%) (8) = (2) 

— K (5,1) — K(4,2,1)(%) + K¢4,3) (3) + (3) — K¢s,2) (4). 
For partitions containing four or more elements it is convenient to expand 
¢.e)(s) in terms of the first column the cofactors of the elements in this 
column being simple characteristics corresponding to three element partitions. 
E. g., suppose we wish ¢,5,2,12)(8): from the expression 


| Ps Po Pr ps 

Pi Po Ps ps 

0 Po Pi Pe 

0 0 po ~r 

we read = 2,12) (8) — (8) and from the supposed known 
analyses of $,2,12)() and we find (since psK (2,12)(%) = K,s,2,1) (3) 
and so on) 


$(5,2,1%) (8) = + + K (3) — (2) 
K(s,4) (2) — K¢5,3,1)(%) — K ¢s,27) () T K (¢5,2,12) (2). 


$(5,2,1%) (8) = 


We give below tables furnishing the expressions for the simple characteristics 
$<) (s) in terms of the compound characteristics K,,)(%) for all values of n 
up to 9 inclusive. The simple characteristics are written down the left-hand 
side of each table (the partitions (e) of n being arranged in dictionary order) 
and the compound characteristics are written across the top of each table 


ind 
ree 
ore 
nly 
und 
on- 


rti- 


ANALYSIS OF THE DIRECT PRODUCT OF IRREDUCIBLE REPRESENTATIONS. 51 


(the partitions (A) of m beimg also arranged in dictionary order). The matrix 
connecting the ¢,<)(s) with the K,,)(%) is triangular with diagonal elements 
all unity and, hence, is of determinant unity; the triangular reciprocal 
matrix expresses the compound characteristics K,,)(#) in terms of the simple 
characteristics ¢,.)(s). These reciprocal matrices have been given, for 
2 nS 9, in our paper (1) (pp. 475-477). From Kostka’s theorem stating 
that the coefficient of K,,)(%) in the development of ¢,)(s) is the same as 
the coefficient of ¢,<)(s) in the development of 7',)(%) we see that the coeffi- 
cients of this latter development are found in the column headed by the parti- 
tion (A) in the tables which furnish the expressions for the $<) (s) in terms of 
the K,,)(%). As examples of how these tables are read we cite the following: * 


{12} =— K(2) + K(12); T (2) = {2} — {1%} 
— + K(2,1); — {2,1} —3{1"} 
(2, 12} = K(4) —K(3, 1) —K(2) + K(2, 1%); 
T (2°) {22} — {2 12} + {14} 
{2°} —— K (4,2) + K(4, 12) + K(3*) —2K(3,2, 1) + K(2*); 
T (3, 2, 1) = {3, 2, 1} — 2{3, 13} — 2{2°} + 4{2, 14} — 6{1°}. 


If we denote by c® the coefficient of K.,)(%) in the development of 


(8): 
de) (8) = 


Kostka’s theorem finds its expression in the formula 
Tn) (3) = De do (8)- 


Expressed in technical terms this says the matrix of the linear transformation 
from the ¢,<)(s) to the 7,)(%) is the transpose of the matrix of the linear 
transformation from the K,,)(%) to the ¢,e)(s). Since the reciprocal of the 
transpose of a matrix is the transpose of the reciprocal of the matrix it follows 
that if we write 

Kn) (3) = > $6) (8) 
then 

(8) = Tir (2). 


(A) 


In other words the coefficients of the expressions for the simple characteristics 
$e (8) in terms of the symmetric functions 7'.,,(#) are found in the column 
headed by the partition (A) in the tables on pp. 475-477 of our paper (1). 
As examples of how these tables are read we cite the following: 


* For convenience we denote by Ky) by K(4); (a) by T(A). 


and 

this 

own 

(3) 

(2). 

stics 

of n 

and 

Jer) 

able 


52 F. D. MURNAGHAN. 


1} = T(2,1) +27 (1) 

{22} 7 (22) + 7 (2,12) + 27 (14) ; 
{3, 17} 7 (3, 17) + T(2?,1) + 37(2, 1°) + 67 (1°) 

{3°} = 7(3*) + 7(3,2,1) + 7(3, 1°) 

+ T(2*) + 27 (22, 12) + 387 (2, 14) + 57(1°) 

We have, accordingly, in the tables of this and the preceding paper the in- 
formation necessary to carry out (as far asn <9, m9) the steps (1) and 
(2) of our general method. 

8. Tables furnishing the expressions for the simple characteristics 
$<)(s) in terms of the compound characteristics K,,)(z) for values of n 
from 2 to 9 inclusive. In the following tables the ¢,e)(s) are denoted by {e} 
and appear down the left-hand side of the table whilst the K,,)(%) are denoted 
by (A) and appear across the top of the table. For convenience of printing, 
Table 8, n =9, is turned around so that the bottom of the page is the left- 
hand side of the table and the left-hand side of the page the top of the table. 
The numbers to the right of the main diagonal of each table are all zero and 
are not written in 


lon=2 (2) (12) 2n=3. (8) (2,1) (18) 
| | {3}] 1 | 
(12}] —1 1 | {2,1}] —1 1 
{13}] 1 | —2 1 | 


3 n=4 (4) (3,1) (2%) (21%) (14) 


—1 1 
1 1 
{2,12}) 1 [=i | 4 
4. n=5. (5) (4,1) (3,2) (8,12) (22,1) (2,18) 
{5}} 1 | 
{41}| —1 1 | 
{3,12} 1 —1 —1 1 | 
(2213! 0 1 | 
1 | -2 | -2 3 3 | —4 1 | 
5. n=6. (6) (5,1) (4,2) (4,12) (32) (8,2,1) (8,18) (28) (22,12) (2,14) (19) 
{6}| 1 | 
(51}/-1 | 1 | 
42} 0 |-1 | 1 | 
(3231 90 | o |-1 | 0 | 1 
{3,2,1}1 0 | 1 | 0 |-1 i | 
| 1 | 1 |-1 | 1 | -2] 1 | 
o |-1 | 1 | 1 O | | 
|—-2 | 1 |-1 4 | 1 |-3 | 1 | 
1912 | 11-6 | 4 | 6 | 1 | 


ion 

14, 

(3, 

(3, 

{2 


53 


(s1'Z) 
(1'sz) 


(sf) | 


(91 ‘Z) 


(eI ‘sZ) 


1 
—3 


(52) 


1 

0 

-1 
—1 
—1)-1 

6 


Z 
< 
M 
= (26°) 
¢ NN OD | | | 4 
(s1°F) | | | 
av) | 
(2) | | | | (z'9) OR FORO BRANNAN 
| | | | | 
D 


a 

| 
| 
} | 
| 


54 


(eT) 
(1 
(91‘€) 
(eI ‘z€) 
(s€) 
(22'S) 
(sT‘9) 
(1‘Z‘9) 
(¢‘9) 
(2'2) 
(1‘8) 


F. D. MURNAGHAN. 


1 


1 


1 
—6 


—+ 
10 


1 
3 


10} —1|—4 


1 


1 
—5 
0} O 
—2| 
3} 


1 
—3 
6 
—3 
3 
3 


1 
1 


1j-1 
2} 2 


10}—4| 30)—30| 7| 21)/-8 


1 
0 
—4 4/-18 


1 
0 


2} 
1 
1 


—2 
—2 
—8 


1 
2 
3 
0 
1 
2 
—1 
1 
—3 
—1 
2 
—3 
9 


| 
| 1 | | 


| | | | 

Sass 

1 | | | 


1 
—1 
3 
0 
—1 
—1 
2 
3 
—1 
—6 
—4 
—1 
4 
6 
5} 3) 6|—12;—12} 


1 | 


| | | 
| | | | 1 | | | | 
1 | 


an 


|_| 
= 
‘ 
| | 
( 
| | 
I 
t 
8) 
| | 8 
f 


ANALYSIS OF THE DIRECT PRODUCT OF IRREDUCIBLE REPRESENTATIONS. 55 


4. Rules for the analysis of the direct product with illustrative 
examples. If (¢’) is the associate partition of n to (€) de)(8) is obtained 
from ,e)(s) by changing the signs of 82, s4,- - - (p. 453). Hence the analysis 
of ($8) (8) follows from that of $e) (8) ¢:v) (8) by taking the associates 
of all the partitions found in the latter. Thus from 


{2, 1} {17} {3, 2} + {3, 1?} {2?, 1} + {2, 1°} 
we read 


{2, 1} = {4, 1} + {3, 2}+ {3, 1*} + {2?, 1}. 


In calculating {e}- {v} it is convenient to take n< m and to have (e) pre- 
ceded by (if not identical with) its associate (e’) ; otherwise we calculate first 
{/}- {’} and read off from this the desired result. The obvious reason for 
this is that the tables of the preceding paragraph and of (1) pp. 475-477 are 
simpler as we proceed farther out in our ordered set of partitions. As we have 
seen in § 2 the general procedure in analysing {e}- {v} may be formalised as 


follows 


A. Precede by » each partition of the, supposed known, analysis of 
{v2,° * +, vm} by {e}. This step is the same for all {e}, {v} and we 
need say nothing further about it. 

B. Precede by v; +1 each partition of the, supposed known, analysis 
of the product of {v2,- + -,vm} by a linear combination of simple 
characteristics of the symmetric group on n — 1 letters. This linear 
combination is determined, by the method explained in § 2, from the 
tables of §3 and of (1) pp. 475-477. We shall denote it by the 
symbol B and shall give below tables furnishing B for all partitions 
(e) of n=8 for which (e) follows, if it is not identical with, its 
associate (¢’). 

C. Precede by v, + 2 each partition of the, supposed known, analysis of 
the product of {vz,- - +, vm} by a linear combination of the symmetric 
group on n — 2 letters. We denote this linear combination by C and 
give it for the same partitions (e) as before. 

D. Same as B save that v, +1 and n—1 are replaced by v, + 3 and 
n — 3, respectively. 

E. Same as B save that v, +1 and n—1 are replaced by ». + 4 and 
n — 4, respectively, 


and so on. 


Tables furnishing the linear combinations B, C, D, E 


Q —~— baled | 
on 
a ~ 
< + + 
= = ++ 


3. 
| 
E 
(2,1 


(2,1 


ANALYSIS OF THE DIRECT PRODUCT OF IRREDUCIBLE REPRESENTATIONS. 57% 


Illustrative examples. Since we give below the complete table analysing 
the direct product I'- I” for the cases m + » = 10 we shall illustrate by ex- 
amples for which m+ n> 10. We arrange the notation so that n= m and 
if («) precedes its associate partition (e’) we first multiply the associated 
representations and then take the associates of all terms occurring in the 


product. 


Example 1. {2,1} - {4, 2?}. 
We read from Table 6 (1, p. 484) 


{27} {2, 1} = {4,3} + {4, 2, 1} + {3%, 1} + {8, 2°} + {8, 2, 17} + (2%, 1} 


and prefix a 4 to each of the partitiong of 7 which occur on the right. From 
Table 2 above we read B = {2} + {1*} and from Table 5 (1, p. 484) we read 


{2°} [{2} + {17} ] = {4, 2} + {87} + 2{8, 2, 1} + {29} + {2?, 17}; 


we then prefix a 5 to each of the partitions of 6 which occur on the right. 
Again from Table 2 above we read C = {1} and from Table 4 (1, p. 483) 
{27} {1} = {3, 2} + {27,1}; we prefix a 6 to each of the partitions of 5 
which occurs on the right. Hence the desired analysis of the direct product 


* 18 


{2, 1} ‘ {4, 2}= {6, 3, 2} + {6, 2?, 1} + {5, 4,2} + {5, 3*} + 2{5, 3, 2, 1} 
+. {5, 28} + (5, 22, 12} + (42, 3} + (42, 2, 1} + (4, 32,1) 
+ {4, 3, 27} + {4, 3, 2, {4, 1}. 


As a check against possible errors in copying from the tables the dimensions 
of the representations on both sides should be calculated. Thus {2,1} is the 
characteristic of an irreducible representation, of dimension 2, of the sym- 
metric group on 3 letters whilst {4, 27} is the characteristic of an irreducible 
representation, of dimension 56, of the symmetric group on 8 letters. Hence, 
(1, p. 448) {2,1} - {4, 27} is the characteristic of a reducible representation, 
of dimension 2+ 56-11!—+-3!8! = 2*-3-5-7-11 = 18,480, of the symmetric 
group on 11 letters. The dimensions of the various irreducible representations 
of the symmetric group on 11 letters which occur in the analysis of {2, 1} - {4, 27} 
given above may be calculated by means of the formula of Frobenius (1 (11) 
p. 460) or read off from the table (7, pp. 201-204). On removing, for con- 
venience, the common factor 11 we obtain the check 


| 

| 

\ 
\ 
\ 


58 F. D. MURNAGHAN. 


1,680 = 90 + 100 + 90 + 60 + 420 + 75 
+ 140 + 42 + 120 + 108 + 120 + 210 + 105. 


Example 2. {8,1} {7, 2}. 


Here (3,1) precedes its associate (2, 17) and so we calculate {2, 17} - {2?, 1°}. 
Reading the tables as explained in detail in the previous example we have first 
to calculate {2,17} - {2,15}. To do this we read off 


{2, 17} {1°} = {3, 2%, 17} + {3, 2, 1*} + {8, 1°} + 1°} + {2?, 1°} + {2,1}. 


On prefixing a 2 the terms arising from those partitions of 9 on the right 
which begin with a 3 vanish since {2, 3, 2,17} —=0 (1, p. 461) etc. Hence 
the terms in the analysis of {2,1°}-{2,1°} which begin with a 2 are 
{2*, 1°} + {2%, 15} + {27,17}. To find the terms beginning with a 3 we 
read off 


[{2, 1} + {1°}] {1°} caus: {3, 2, 1°} + {3, 1°} 
+ {2%, + 2{2?, 14} + 2{2, 1°} + {19} 


so that the terms beginning with a 3 are 
{3?, 2, 1°} + {3%, 1°} + {8, 2%, 1?} + 2{38, 2%, 1*} + 2{8, 2, 1°} + {8, 1%}. 
The terms beginning with a 4 are found from 


{17} - {1°} = (27, 19} + {2, 1°} + {17} 
and are 
{4, 2°, 1°} + (4, 2, 1°} + (4, 1°}. 


Hence we have 


{2, 17} - {2, 1°} = {4, 27, 1°} + {4, 2, 1°} + {4,17} + {8%, 2, 1°} + {3?, 15} 
+ {3, 2°, + 2{3, 2%, 1*} + 2{3, 2, 1°} + {3, 1°} 
+ {24*, 19} + 1°} + {2?, 17} 
(a preliminary calculation which would have been unnecessary if we had 


already prepared the table for n+ m=—11). The terms beginning with a 2 
in the analysis of {2,17} - {2?, 15} are accordingly 


{2, 4, 2%, 1°} 4 {2, 4, 2, 15} 4+ {2, 4, 1°} + (25, 19} 4 (24, 19} + (2%, 17) 
— — {3°, 22, 1°} — (32, 2, 19} — (32, 17} + (2%, 18} + (24, 19} + (28,1) 


? 
a 


ad 


ANALYSIS OF THE DIRECT PRODUCT OF IRREDUCIBLE REPRESENTATIONS. 59 


(the terms beginning with 3, in the analysis of {2, 17} - {2, 1°} vanishing when 
the 2 is prefixed). To find the terms beginning with 3 we read off, from the 
table below, [{2, 1} + {1°}] - {2, 1°} obtaining, on prefixing a 3, 


{3%, 1*} + 2{3?, 2°, 1°} + 3{3?, 2, 1°} + 2{3, 1°} 
+ {8, 24, 17} + 2{3, 2%, 1*} + 2{3, 2?, 1°} + (3, 2, 1°}. 
To find the terms beginning with 4 we read off {17} - {2,15} and obtain, on 
prefixing a 4, 
{4, 3, 2, 1*} + {4, 2°, 1°} + {4, 3, 19} + {4, 2?, 1°} + {4, 2, 1°}. 


Collecting our results we find 


{2, 12} - (22, 1°} = {4, 3, 2, 14} + {4, 2%, 1°} + {4, 3, 1°} + {4, 2%, 15} 
+ {4,2, 17} + {3%, 14} + (32, 2%, 19} + 2{3%, 2, 15} 
+ {3,17} + (8, 28, + 2{3, 2%, 14} + 2{3, 2%, 1%} 
+ {3, 2, 1°} (2%, 19} (28, 1} 1°}. 


On taking the associates of the representations occurring in this analysis 
we obtain 


{8,1} - {7, 2} — {10, 3} + {10, 2,1} + {9, 4} + 2{9, 3,1} + {9, 27} 
+ {9, 2,17} + {8, 5} + 2{8, 4,1} + 2{8, 3, 2} + {8, 3, 17} 
+ {8, 27,1} + {7, 5,1} + (7, 4, 2} + {%, 4, 17} 
+ {7,37} + 3, 2,1}. 


On dividing out by the common factor 13 we have the check by dimensions: 


4455 = 16 + 33 + 33 + 210 + 72 + 110 + 44 + 396 
+ 528 + 324 + 280 + 220 + 462 + 528 + 275 + 924. 


Example 3. {2?,1}- {4, 3, 1}. 


This is the example, referred to in the introduction, of which the analysis 
was wrongly printed in (2). We read off 


{2?, 1} ‘ {3, 1} a {5, 3, 1} + {5, 2} + {5, 2, *) + {4, 3, 2} + {4, 3, 17} 
++ 2{4, 27,1} + {4, 2, 1°} + (8%, 2,1} + (3, 2°} + (3, 2%, 17} 


so that the terms in the desired analysis beginning with 4 are 


(42, 3, 2} + (4%, 3, 1°} + 2(42, 2, 1} + (47,2, 19} 
+ {4, 8, 2,1} + (4,3, 2°} + (4, 3, 2%, 1°}. 


} 
‘st 
j . 
ce 
re 
ve 
| 


60 F. D. MURNAGHAN. 


Also we read off 


[ {27} +- {2, 1?}] ' {3, 1} im {5, 3} + 2{5, 2, 1} + {5, 1°} 4+- 2{4, 3, 1} 
+ 2{4, 27} + 3{4, 2, 17} + {4, 1*} + {38?, 2} 
+ {3?, 17} + 2{3, 27,1} + {3, 2, i 


and the terms in the desired analysis beginning with 5 are obtained by pre- 
fixing a 5 to the partitions of 8 which appear on the right-hand side. Finally 


{2, 1} - {3, 1} = {5,2} + {5, 17} + {4,3} + 2{4, 2, 1}+ {4, 1°} 
+ {32,1} + (3,22) + (3, 2,12) 


and the terms in the desired analysis beginning with 6 are found by prefixing 
a 6 to the partitions of 7 which appear on the right-hand side. Collecting 


terms we obtain 


{2?, 1} - {4, 3, 1} = {6, 5, 2} + {6, 5, 17} + {6, 4, 3} + 2{6, 4, 2, 1} 
+ {6, 4, 1°} + {6, 3’, 1} + {6, 3, 27} + {6, 3, 2, 17} 
+ {5°, 3} + 2{5°, 2,1} + (5, 1°} + 2{5, 4, 3, 1} 
+ 2{5, 4, 27} + 3{5, 4, 2, 1°} + {5, 4, 1*} + {5, 3, 2} 
+ {5, 37, 17} + 2{5, 3, 27,1} + {5, 3, 2, 19} + (4%, 3, 2} 
+ {4?, 3, 1°} + 2{4%, 2%, 1} + (4%, 2, 1°} + {4, 8, 2, 1} 
+ {4, 3, 29} + {4, 3, 27, 17}. 
On dividing out by the common factor 13 X 11 we obtain the check by 
dimensions 
3150 = 36 + 40 + 45 + 240 + 72 + 80 + 84-4 144 + 24-4 1204 35 
+ 210 + 180 + 450 + 63 + 81+ 112 + 300 + 144-4 60+ 81 
+180 + 84+ 105 + 60 + 120. 


Example 4, {2*}- {47}. 
For the first set of terms we have to evaluate {2*}- {4} and for this we 
calculate its associate : 
{1*} {4?} {5?, + {5, 4, 1°} + {4?, 1*} 
so that 
{24} {4} (6, 2°} + {5, 2°, 1} + {4, 2}. 
The first set of terms follows by prefixing a 4 and is — {5?, 2°} + {4?, 24}. 
The second set of terms requires the evaluation of {2*,1}- {4}; the associate 
of this, 
{1*} - {4, 3} = {5, 4, 17} + {5, 3, 1°} + {4?, 1°} + {4, 3, 1*} 
so that 
{28, 1} - {4} oo {6, 2”, 1} + {5, 2°} 4- {5, 2?, 1?} + {4, 2%, 1}. 


pre- 
ally 


‘ing 
ing 


we 


}. 
te 


ANALYSIS OF THE DIRECT PRODUCT OF IRREDUCIBLE REPRESENTATIONS. 61 


Hence the second group of terms is 
{52, 2°} + (52, 2, 12) + (5, 4, 2%, 1}. 
The third and last set of terms requires the product 
{2°} - {4} = (6, 27} + {5, 2%, 1} + {4, 2°} 
and is 
{6?, 27} + {6, 5, 27,1} + {6, 4, 2°}. 
Collecting we have 
{2*} - {47} = (67, 27} + {6, 5, 27,1} + {6, 4, 2°} 
+ {5?, 2?, 17} + {5, 4, 2%, 1} + {4?, 2°}. 
A partial check on the accuracy of the analysis is furnished by the fact that 
the six representations occurring on the right consist of three pairs of asso- 
ciated representations so that the direct product To‘) -T,42) is self-associated, 
as it must be. On removing the common factor 13-11-%7-5-3-2 we obtain 
the check by dimensions 84—5-+ 21+ 16+16+21+5. The direct 
product being analysed in this example is a representation, of dimension 
13-11-77? - 5-3? - 2% = 2,522,520 of the symmetric group on 16 letters. 
5. Table furnishing the analysis of T.I” for n-+-m=10. (For an 
explanation of how this table is read see 1, p. 482). 


BRAS 
ese See 
(9).(1)) | | | (19).(1) 
(8,1).(1)| | 4] 1] (2,17).(1) 
(7,2).(1) 1} | 1) 1 (22,15).(1) 
(7,12).(1) | (3,16).(1 
(6,3).(1) | 1 (23,18).(1) 
(6,2,1).(1) 1} 1] 1 | (3,2,14).(1) 
(6,12).(1) 1) 1 (4,15).(1) 
(5,3,1).(1) 1 1} 1) 1 (3,22,12).(1) 
(5,22).(1) 1 1} | 1 (32,15).(1) 
(5,2,12).(1) 1 | | (4,2, 13).(1) 
(5,14).(1) | | 1 1} 1 (5,14).(1) 
(42,1).(1) | | 1 1} 1 (3,23).(1) 
(4,3,2).(1) | 1 1} | (3?,2,1).(1) 
(4,3,12).(1) 1 1} | 1) 1) |(4,22,1).() 
(33).(1) | 1 1|(33).(1) 
(8).(2)| 1) 1) 1) | | (18).(1?) 
(7,1).(2) 1) 1} 1) 1) 1 (2,16).(1?) 
(6,2).(2) 1 1} 1 1} 1) 1 (2?,14).(1?) 
(6,12). (2) 1 1 (3,15).(12) 
(5,3).(2) | | 1 1] 1) 1 (23,12).(12) 
(5,2,1).(2) 1} 1} 1 1} 1} 1} 1 (3,2,13).(1?) 
(5,13). (2) 1} 1 1 (4,14).(1?) 
(42).(2) | | 1 1 1 (24).(12) 
(4,3,1).(2) | | 1 1} 1) 1 1} 1] 1) 1 (3,22,1).(12) 


62 F. D. MURNAGHAN. 


(6,2,1?) 
(6,14) 
(5,3, 1?) 


3 
(7,2,1) 
(7,18) 
(6,4) 
(6,3,1) 
(5?) 


(10) 
(9,1) 
(8,1?) 
( 
(5,15) 


(8,2) 


7, 


(6,22) 


(5,4,1) 
(5,3,2) 


(4,22). (2) 


(5,22, 1) 


be 

bo 
bo 
— 


(5,1). 


no 
— 
_ 
_ 


w 
_ 


— 


(11°) 
(2,18) 
(2?,18) 
(3,1") 
(28,14) 
(3,2,15) 
(4,16) 
(24,1?) 
(3,2?, 18) 
(5,15) 
(2°) 
(3,28,1) 
(3?,2,12) 
(4,3,18) 
(6,14) 


(4,2,12). (2) 1 | 1 1 | 
(4,14).(2) 1 1/1] 
(3?,2).(2) 1 
(32,12). (2) 1 
(3,28, 1).(2) | 
(3,2,18).(2) 
(3,15).(2) 
(24). (2) | 3 
(28,12).(2) 
(2?,14).(2) | | 
(2,18).(2) 
(18).(2) | 
(7).(3)} 1 | 1 41 1 
(6,1).(3) | | 
(5,2).(3) 1 1/1 | 
(5,1?).(3) 1 1 1 
(4,3).(3) 1 
(4,2,1).(3) 1 BE 
(4,13).(3) 1 1/1 1 1 
(32,1).(3) 1 1 1 | l 
(3,2?).(3) 1 1 1 ia 
(3,2,1?).(3) 1 
(3,14).(3) 1 1/1} 
(28,1).(3) 1 
(22,18).(3) 1 
(2,15).(3) 1 | 
(17).(3) 
(7).(2,1) 1/1/1 | 
(6,1).(2,1) 1/1 | 
(5,2).(2,1) 
(5,12).(2,1) | 
(4,3).(2,1) 
(4,2,1).(2,1) 
(4,18).(2,1) 
(32,1).(2,1) 
(6).(4)| 1 | 1 
(4) 1 
(4,2).(4) 
(4,12).(4) 
(32). (4) 
(3,2,1).(4) 1 | 
| 
(3?).(3,1) 
(3,2,1).(3,1) | 
(3,13).(3,1) | 
(28).(3,1) 


63 


ANALYSIS OF THE DIRECT PRODUCT OF IRREDUCIBLE REPRESENTATIONS. 


(orl) (OT) 
$ ‘ 
(s1‘Z) (1‘6) 
‘ ‘ 
(912d) on 
‘ ‘ 
(51 ‘sd) (¢°2) 
‘ 
(sd) 99) 
Sas 
6 
‘ 
fat 
‘ 
‘ 
‘ 
(91 ‘F) (sT L) 
Spy 
‘ 
‘ 
(eI 
6 
(st (12s ‘g) 


~ 
Bit ) 
| 
1|§ 
| 
| 
| 
| 1 
1 | 
| 

| 
An 
| 
| 
Le 
| 
1 | 
| | 
HH 
|] 3 | 
1| | 
| 


64 F. D. MURNAGHAN. 


(22,12).(3,1) 
(2,14).(3,1) 
(16).(3,1) 
(6).(22) 1 1 1 
(5,1).(22) 1 1 1 | 1 
(4,2). (22) 1 
(4,12).(22) 1 1 44 14-44 
(32).(22 Bet 
(3,2,1).(22) £4342 1% 
(5).(5)|} 1 | 1} 1 1 1 1 
(4,1).(5) ee ae! 1/1 1 
(3,2).(5) 1 BE 1 
(3,12).(5) 1 1/1 1 1 1 
(22,1).(5) 1 1/1 1 
(2,13).(5) 1 1/1 1 
(15).(5) 1 1 
(4,1).(4,1) Bo Se Be we 1/2/1/1 
(3,2).(4,1) 1/1 Be | 1/2/1/1 
(22,1).(4,1) 1/1/1 
(2,13).(4,1) 1 11/1/2/1 
(3,2).(3,2) 2121281111 
(3,12).(3,2) 1 1 SiR TS TR 14 
— ¢ 
REFERENCES. 


1. F. D. Murnaghan, “On the representations of the symmetric group,” American 
Journal of Mathematics, 59 (1937), pp. 437-488. 

2. D. E. Littlewood and A. R. Richardson, “Group characters and algebra,” Philo- 
sophical Transactions of the Royal Society of London (A), 238 (1934), 
pp. 99-141. 

3. C. Kostka, “ Uber den Zusammenhang zwischen einigen Formen von symmetrischen 
Funktionen,” Journal fiir die reine und angewandte Mathematik (Crelle), 
93 (1882), pp. 89-123. 


4. ———, “Tafeln und Formeln fiir symmetrische Funktionen,” Jahrbuch der Deut- 
schen math. Ver., 16 (1907), pp. 429-451. 
5. ———, “Tafeln fiir symmetrische Funktionen bis zur elften Dimension mit kurzen 


Erlauterungen,” Prog. (5) kgl. Gym. u. Realgymn. Insterburg (1908). 

6. I. Schur, “Uber die rationalen Darstellungen der allgemeinen linearen Gruppe,” 
Berliner Berichte (1927), pp. 58-75. 

7. M. Zia-ud-Din, “The characters of the symmetric group of order 11!,” Proceedings 
of the London Mathematical Society (2), 39 (1935), pp. 200-204. 

8. J. R. Roe, “ Interfunctional expressibility tables of symmetric functions,” Syracuse 
University (1931). This, privately published, collection of tables contains 
(Plate 1) Kostka’s table for n = 10. 


— 
| 


65 


IRREDUCIBLE REPRESENTATIONS. 


DIRECT PRODUCT OF 


THE 


ANALYSIS OF 


(oT) 


(s1‘Z) 
(91 

S'S) 


(22‘z8) 


(91 
(21 


(sa‘F) 
(st 


= Sete 
are ee 
rere 
ere 


(OT) 
(¢°2) 
(29) 
(21°8) 
(1°S‘L) 
(1‘¢‘9) 
(21 ‘eb) 


THE Jouns Hopkins UNIVERSITY. 


ples) 


e), 


ise 
ins 


- 
| 
1; 
| 
| 
| 
He 
| #1 ) 
| ) 
1| 
1 
|] 12 | 
12 ) 
2 
) 
an 
lo- 
4), 
_| 
ut- 
ven 
| 


ON A CLASS OF ARITHMETICAL FOURIER SERIES.* 


By PuHitie HarTMAn. 


Let 


n=1 ax 


There have been several papers justifying this process for particular sequences 
{cn}; see, for example, Landau?’ for cp = Chowla? for = n%, a < 0; 
Chowla and Walfisz * for c, = 1; Hartman and Wintner * for cn = n%, a < }; 
Davenport for ¢n p(n), for =A(n), for —=A(n), and, finally, for 
Cn? = p(n) and all other cn = 0. In most of the above examples, (2) is shown 
to hold almost everywhere, but (2) is valid for all x in the cases treated by 
Landau,’ Chowla,? and the last example of Davenport.° 

A more general problem is treated iu this paper. Let f(z) be a function 
of period 1 which can be represented almost everywhere by a convergent 
trigonometrical series, say 


(3) f(x) + cos + sin 
k=1 


Conditions on f(x) and on sequences {¢n} will be investigated under which 
the following identity 


(4) Senf(nz) [( > cata) cos + ( > sin 
n=1 k=1 a/k 


k=1 d/k 


* Received July 28, 1937. 

1E. Landau, “ Konvergenzbeweis einer Lerchschen Reihe,’ Mémoires de la Société 
Royale des Sciences de Bohéme, Classe, des Sciences, (1919), IV. 

2S. D. Chowla, “Some problems in diophantine approximation (I) ,” Mathematische 
Zeitschrift, vol. 33 (1931), pp. 544-563. 

®§. D. Chowla and A. Walfisz, “ tber eine Riemannsche Identitiét,” Acta Arith 
metica, vol. 1 (1935), pp. 87-112. 

*P. Hartman and A. Wintner, “On certain Fourier series involving sums of 
divisors,” to appear in the Acta Arithmetica. 

5 H. Davenport, “ On some infinite series involving arithmetical functions,” Quarterly 
Journal of Mathematics, vol. 8 (1937), pp. 8-13. 


66 


then we have the formal identity 


nces 
< 0; 


~ 


for 
own 


1 by 


tion 
zent 


hich 


sche 
rith- 
3 of 


‘erly 


ON A CLASS OF ARITHMETICAL FOURIER SERIES. 67 


is valid. Landau, Chowla, Walfisz, and Davenport did not seem to recognize 
that the problem was a Fourier series problem and obtained their results with 
the aid of diophantine approximation. Fourier series methods are employed 
here; the methods are essentially the same as those used by Wintner.® 


THEOREM 1. Let f(x) be a Lebesgue integrable function with the 
period 1 and 


(5) f(x) ~& (ay cos + sin 
k=1 

Let « and B be two real numbers such that 

(6) m min 8) > 4, 

and 

(7) = O(k-*), by; == O(k~), 

and, finally, 

(8) Cn = O(n), 

Then 


(i) the partial sums of the series 
co 
(9) > Cnf (nz) 
n=1 
lend in the mean (L*) to a function F(x), for every q<1/(1—™m) or for 
every > 0 according asm <1 orm=21; 
(ii) F(x) possesses the Fourier series 
co . 
(10) F(t) ( > cata cos ( > sin | 
k=1 d/k d/k 
(iii) the Fourier series of F(x) converges for almost all x to F(z), 
(11) F(z) ( > cos +- ( > cabes) sin |. 
k=1 afk 
It follows from the formal identity (4) that it was necessary to suppose 


== 0, unless cy is a convergent series. The conditions imposed on f(z 
? 5 


n=1 


allow several immediate conclusions to be drawn. Suppose, first, that « < 1 
and let p be a number such that 


2>p> 1/2, 


°A. Wintner, “On a trigonometrical series of Riemann,” American Journal of 
Mathematics, vol. 59 (1937), pp. 629-634. 


68 PHILIP HARTMAN. 
so that 
oo 
(| ax |? + | dx |?) < 
=1 
then by the Hausdorff? extension of the Fischer-Riesz theorem, f(x) belongs 
to the class L”’, p’ = p/(p—1) <1/(1—~«). By the same arguments, it 


follows that if a=1, f(x) belongs to L4 for all g >0. Thus, in any case, 
f(x) belongs to L*. Also there exists a positive number 6 such that 


24—8>1, 
hence 


k=1 
this implies that the Fourier series (5) is convergent almost everywhere to f(z), 
(12) f(x) (a cos 2kax + by sin 
k=1 
Proof of (t). Suppose first that m < 1, it is to be proved that if 
N 
(13) Fy(x) (nz), 
n=1 
then there exists a function F(z) such that 
1 
(14) f, [FP — Fy(x) > 0 as Noo if g<1/(1—™m). 
0 


Now it is clear that Fy(x) is integrable and 


oo d=N d=N 
(15) Fy(a) ~> ( > cata) cos 2hra + ( > sin | 
k=1 a/k 


Let M > N, then it follows from the Hausdorff? extension of the Parseval 
relation that 


[Pu(2) 


|N<dSM a/(q-1) c|N<dSM a/(q-1) 
= +4 = Cabxsa ] 
k=1 a/k k=1 a/k 


provided that ¢ = 2, as one may suppose. Thus, if g= 2, 


7F. Hausdorff, “ Eine Ausdehnung des Parsevalschen Satzes iiber Fourierreihen,” 
Mathematische Zeitschrift, vol. 16 (1923), pp. 163-169. 


ongs 
8, it 
case, 


(2), 


val 


ON A CLASS OF ARITHMETICAL FOURIER SERIES. 69 


1 
an f [Fu(e) —Fy(2) 
0 
oo q/(q-1) q/(q-1) 
(31 com |) +3 ( 3 | ) ] 
k=N\ d/k k=N \ d/k 


Now from (7) and (8) 


7 oa-p(k) 
where 
(19) oy(k) d’. 
d/k 


It is known * that if y = 0, then 


(20) oy (k) = 
and if y = 0, then 
(21) oy(k) = O(k*), 
where « > 0 is arbitrary. Thus from (6), (18), (20) and (21), it follows that 
(2%) | Catusa | O(b-™**), 
d/k 
and similarly 
(23) > | ya | = O(k 
Uk 


Hence if g < 1/(1—™m). then ¢/(q¢—1) > 1/m, so that the series 


q/(q-\) q/(q-1) 
(24) > [( | | | | 
k=1 d/k a/k 


is convergent. Thus, from (17), 
1 
0 
iffq<1/(1—~m). Thus, for the case m <1, (i) now follows from (25) 
and the completeness of the space L’. The case where m = 1 follows similarly. 


Proof of (wv). This follows at once from (i), for if a,-%, bY are the 
k-th Fourier coefficients of Fy(a), then it follows from (14) that 


(26) lima.’ =A, and limb,’ — B, 
Nox NOx 


exist and are the k-th Fourier coefficients of F(z). Now from (15), 


d<N 
(27) and = & cabzya, 
d/k d/k 


*T. H. Gronwall, “ Some asymptotic expressions in the theory of numbers,” Trans- 
actions of the American Mathematical Society, vol. 14 (1913), pp. 113-122. 


70 PHILIP HARTMAN. 


so that (10) follows from (26) and (27). 
Proof of (mw). From (10), (22), and (23), it follows that 


(28) Ay =O(k-™ ©) and B,=O(k™), 


so that there exists a 8 > 0 such that 


8 


(Ai? + By?) ko < 


1 


(29) 


7M 


for « > 0 and & > 0 can be chosen so that 


2m —%—é> 1. 


That the equality (11) is valid almost everywhere is a consequence of (10) 
and (29). 
Next, there will be proved ® 


THEOREM 2. If f(x) satisfies the conditions of Theorem 1 when the 
inequality (6) ts replaced by 
(30) m>% 


and tf f(x) 1s bounded, then the series (9) converges almost everywhere to F(z). 
Proof. Let p be a number satisfying 


(31) 0<p<3, 
then, in virtue of (30), 


(32) p(l—m) <1. 


Also there exists an e > 0 and a number p satisfying (31), and a fortiori (32), 


such that 
(33) p(2m—1—e) >1. 


® This theorem and a similar proof for the case where f(#) is the function (1) and 
where c, = n-2, a > % was also known to Professor Walfisz (unpublished), as I under- 
stood after this paper was completed. Incidentally, a similar theorem was proved by 
F. Jerosch and H. Weyl, “tber die Konvergenz von Reihen, die nach periodischen 
Funktionen fortschreiten,” Mathematische Annalen, vol. 66 (1909), pp. ©7-80. 

(Cf., also 8. Chowla, “On some infinite series involving arithmetical functions ” 
I, II, Proceedings of the Indian Academy of Sciences, Section A, vol. 5 (1937), pp. 511- 
516. These references were obtained from the Zentralblatt fiir Mathematik, vol. 17 
(1937), p. 5 as the above journal was not available. Added Nov. 18, 1937.) 


| 
( 
| 
t 
I 
| 


ON A CLASS OF ARITHMETICAL FOURIER SERIES. 71 
In view of (10) and (15) 
1 ad>N 2 ad>N 2 
(F(x) — Py(«) Pde <3 ( +( 
0 k=1 d/k d/k 
so that from (22) and (23), 
1 oo 
0 k=N+1 
It is seen that if p satisfies (33), then 
co 1 
N=1 0 


is a convergent series; thus, it follows from Fatou’s lemma that 


(85) [F (0) — Fer (2) 
is convergent almost everywhere. Hence for almost all x 
(36) > F(x) as N- if p satisfies (33). 
Now let j be an integer such that | 
(37) [WJ 
then 
j [(N+1)?] 
Fy(2) = 3, eof(na) = 0( wn), 
n=(N?] n=[N?] 
or 


Hence, if 7 satisfies (37), 
(39) F;(z) —F*}(z) 20 as if p satisfies (32). 


Thus (36), (39) and the existence of a p satisfying (32) and (33) implies 


the statement of Theorem 2. 
From this theorem, the following result can be inferred: Let f(z) be a 


bounded integrable function of period 1 such that 
oo 
f(x) ~ + cos + sin 
k=1 


and 
dy, = O(k-2/8)-€) , by = 2/9), 


then for almost all z 
N 1 
=%f(na) —N Owe), 
n=1 0 


72 PHILIP HARTMAN. 


The theorems just proved imply all of the earlier results mentioned at the 
beginning of the paper except those which replaced “ almost all x” by “ all 2.” 
To obtain such results, it is clear that the conditions on the sequence {cp} 
must be such as to insure the convergence of the series (9) everywhere, thus 
defining F(z) by an everywhere convergent series instead of using a definition 
which leaves the function undetermined on a zero set, as in Theorem 1 or 2. 
It is also clear that f(x) will have to satisfy more stringent conditions than 


those above. In this connection, one has 


THEOREM 3. Let the series 


(40) Cn 


n=1 


be absolutely convergent. Let f(x) be a bounded integrable function of period 


1 such that 


(41) g(u,z) ~ ((f(e +t) + f(e—t) —2F (a) ]dt, u>0, 


is of bounded variation in any u-interval to the right of «0 (when con- 


sidered as a function of u) for all x and such that 


(42) g(u, 2) > 0, u— 0 


for allx. Let Vy(t,a) denote the total variation of g(nu, nx) in the interval 
0<u<_t and suppose that 


(43) cn | Va(t, 2) 
n=1 
is convergent for some t=t(x) >0. The identity 


ao 
(44) Senf(nx) = fa. ( > cat cos + ( > sin | 
n=1 1 k=1 


n= d/k d/k 
is then valid for all x. 


Proof. This theorem is merely a rewriting of the de la Vallée Poussin 
test for the convergence of Fourier series for the case at hand. In fact, the 
conditions that g(u,x) be of bounded variation in some interval 0 <u <a 
and that g(u,x) +0 as u-—>0 are precisely the conditions required by the 
de la Vallée Poussin test for the convergence of the Fourier series of f(z) 
at the point z. Thus the conditions of the theorem imply that (3) is valid 
for all 

That 


j 
\ 
( 
| 
t 
( 
p 
a 
a 


ON A CLASS OF ARITHMETICAL FOURIER SERIES. 73 


(45) F(x) = Cnf (na) 


n=1 
ee co 
k=1 n=1 d/k d/k 


is an immediate consequence of the uniform convergence of the series (9), 
which, in turn, follows from the boundedness of f(a) and the absolute con- 
vergence of the series (40). 

One obtains from (41) that 


g(nu, nz) =— + t) + f(na—t) — 2f (na) 


NU 


If the integration variable ¢ is changed to nt, one has 


(46) g(nu, nr) = [f(na + nt) + f(na — nt) — 2f (nz) 


0 


Now put 
(47) G(u,r) = [h(a +t) + F(a —t) —2F (a) 


then it is clear from (46) that 


(48) = Cng (nu, nz). 


If t==t(x) > 0 is chosen so that the series (43) is convergent, G(w, 2) is of 
bounded variation in the interval 0'< uw < t, since the variation of the sum 
of functions does not exceed the sum of the variations of the functions. Also 
since g(u,2) is a bounded function, the series in (48) is uniformly con- 


vergent, thus it follows from 


g(nu, nz) > 0, u—> 0, 
that 
(49) ((u,x) 0, u— 0. 


Thus the function F(z) satisfies the criterion of de la Vallée Poussin at every 
point, hence (44) is valid at every point in virtue of (45). This completes 
the proof. 

Theorem 3 implies the very particular cases treated by Landau, Chowla, 
and Davenport. Let the function f(z) of Theorem 3 be the function (1) 
and let g(u,x) have the corresponding meaning. Let n and « be fixed; 
suppose that az is not of the form 7/2n, where j is any integer, otherwise 
g(nu, nx) = 0 for all u. Let 


(| 

nj 

us 

2. 

in 

n=1 

l 


74 PHILIP HARTMAN. 


n 
and put 
(50) a — max (2—*—=*, £2), 
nm 
Then 
y (nz + nt) + y(na— nt) — 2y(na) = 0 
if0<t<@; 
(51) w(na + nt) + y(na— nt) — 2y(nz) (—1)4 
if (r =0,1,-- 


y(nz + nt) + — nt) — 2y (nz) = 0 
if r/n+a,<t< (r+1)/n+@,; 


where p(x) is 0 or 1 according as «<< (2k—1)/2n ‘or x > (2k —1)/2n. 


Hence 
g(nu, nz) = 0 
if uUu< 
(52) g(nu, nz) = (—1)4 [r(a,— a) + u—r/n— a2] /u 
if a3 (r= 0,1,-- >). 
g(nu, nz) = (—1)#(r + 1) (a, — a2) /u 
if r/n+a,<u< (r+1)/n+ @; 


It follows from these relations, by straightforward appraisals which can be 
left to the reader, that 


(53) Va(t, rv) SClognt, it 


where C is a constant independent of n, ¢ and z. Also, it is obvious that (42) 
is satisfied. Thus, if the sequence {c,} is such that 


oo 


(54) x 


Cn log n | 
n=1 


7 


is convergent, the identity (2) is valid for all z. 


THE JOHNS HOPKINS UNIVERSITY. 


ii 
u 
0, 
| 
tl 
fi 
th 
if 
pe 
fie 
per 
Jay 
kor 


THE STRUCTURE OF LOCAL CLASS FIELD THEORY.* 


By O. F. G. ScHILLING.t 


The gist of local class field theory may be sought in the characterization 
of the finite abelian extensions over perfect fields whose valuation groups are 
discrete and archimedian and whose residue class fields are finite Galois fields. 
M. Moriya and the author of the present paper recently investigated the 
generalization of this theory to infinite extensions of p-adic number fields.’ 
We found that the structural theory of finite abelian extensions over such in- 
finite fields is in general determined by the nature of the value group and the 
field of residue classes. The results obtained can be interpreted from a more 
complex viewpoint as a theory of sufficient conditions which admit the develop- 
ment of a local class field theory securing the same theorems as in the finite 
case. Thus one is naturally lead to the following problem which we wish to 
investigate in this paper: Is tt possible to prove structural properties of a field 
which is perfect with respect to al discrete archimedian valuation if some set 
of standard theorems of local class field theory is valid for the given ground 
field? Or, we ask for necessary conditions of theorems in local class field 
theory. 

We shall see that most of the standard theorems imply that the residue 
field of the given perfect field is algebraically perfect ? and that it possesses for 
each integer n exactly one cyclic extension of degree n. One easily observes 
that the set of possible residue class fields thus described is rather extensive, 
thus it is necessary to impose further conditions upon the given ground field 
if one wishes to characterize the p-adic number fields we investigated in the 
paper already mentioned. The simplest assumption we found was to postulate 
the validity of the given set of theorems for all perfect subfields of the given 
field. 

In order to avoid unnecessary repetitions in our later investigations it is 
advisable to recall some known facts about the algebraic and arithmetical 


* Received September 28, 1937. 

f Johnston Scholar at The Johns Hopkins University for 1937-38. 

1M. Moriya and O. F. G. Schilling, “ Zur Klassenkérpertheorie tiber unendlichen 
perfekten Kérpern,” Journal of the Fac. of Sci. Hokk. Imp. Univ., ser. I, vol. 5 (Sapporo, 
Japan) 1937. 

* We shall use “algebraically perfect” as translation of the German term “ voll- 
kommen.” 


76 0. F. G. SCHILLING. 


theory of perfect fields. Let & be a field which is perfect with respect to a 
discrete archimedian valuation p. The maximal order of all p-adic integers 
in k be denoted by o, and let p = (7) be the prime ideal of ». Then the ring 
o/p of residue classes is a field k. In case that k is an algebraically perfect 
field the structure of & is uniquely determined by that of k provided that the 
characteristic x(k) is a unit or prime element of k.* There can be established 
a one to one correspondence between the finite algebraic extensions U of k and 
the unramified extensions U of k, the fields U being the residue class fields 
of the fields U. In particular, if U is a normal extension with the Galois 
group [T(U,k) then the Galois group T(U,k) of the residue class field is 
equal to T(U,k). 

The arithmetic theory of normal division algebras D of finite rank m over 
k is very simple because of the validity of Hensel’s criterion of reducibility.' 
All quantities of D which satisfy a minimal equation with highest coefficient 
one and whose coefficients lie in 9 form a maximal order ©, these elements are 
characterized by the property that their reduced norms are p-adic integers. 
The ring © contains a uniquely determined principal prime ideal % whose 
powers exhaust the set of all ideals with respect to D. The ring of residue 
classes O/ is a division algebra of finite rank f over the field o/p, f being 
called the residue degree of D with respect to k.2 Since © is a principal ring 
the extension Op = pO of the prime ideal p is a positive power 8° of the two- 
sided prime ideal $; the integer e is called the ramification degree of D with 
respect to k. Between the two numbers e and f related to D with respect to k 
the following important relation holds 


ef =m. 


Since a prime element II of $% generates a ramified subfield &(I1) of D 
whose ramification exponent is equal to e we find that e is a divisor of the 
degree n = of Hence 


e=n and 


A normal division algebra D of degree n over k can always be represented 


*H. Hasse and F. K. Schmidt, “ Die Struktur diskret bewerteter Kérper,” Crelle, 
vol. 170, (1933). 

*H. Hasse, “ther p-adische Schiefkérper und ihre Bedeutung fiir die Arithmetik 
hyperkomplexer Zahlsysteme,”’ Mathematische Annalen, vol. 104 (1931). 

5M. Deuring, “ Algebren,” Hrgebnisse der Mathematik und ihrer Grenzgebiete, 
Berlin 1935, Chap. VI, §§ 11, 12. 

*T. Nakayama, “ Divisionalgebren iiber diskret bewerteten perfekten Kéorpern,” 
Crelle, vol. 177, 1937. 


ths 


ove 


iJ 
b 
a 
d 
( 
t 
pe 
cc 
n 
Ta 
fie 
m 
ce 
q 
peo 
al 
m 
fie 
Ove 


le 


THE STRUCTURE OF LOCAL CLASS FIELD THEORY. ra 


in its classof normal algebras over k as the crossed product (a(o,7),U,T(U,k) ) 
belonging to a suitably chosen normal unramified splitting field U of D. For 
an arbitrary but fixed choice of a prime element z of p this crossed product 
decomposes uniquely into a cyclic ramified algebra and an unramified algebra 


(*) D~ (a(o, U, r(U, k)) U, k)) 
x (n(o,7), U, r(U, k)), 


where a(o,7) = r)e(o, 7) ; n(o, 7) belonging to a fixed set of multi- 
plicative representatives of the residue class field U and «(o, 7) =1 (mod p). 
Since the exponents ¢(o,7) form a set of addends belonging to the Galois group 
T(U,k) they uniquely determine a cyclic subfield of U. It is readily seen 
that the algebra (7°°%”, U,T(U,k)) is similar to a ramified division algebra 
possessing the aforementioned cyclic subfield of U as splitting field. The 
second factor in (*) represents an unramified division algebra D’ which 
corresponds to a division algebra D’ over k; we observe that D’ need not be 
normal over k.” 

A simple group theoretical consideration shows that the group of all 
ramified algebras {(4°7),U,0(U,k&))} split by a fixed normal unramified 
field U over k is isomorphic with the factor group [(U,k)/"'(U,k)’ of T(U,k) 
with respect to its commutator group. 

A normal division algebra D possesses always unramified maximally com- 
mutative subfields U provided that the algebra of residue classes O/$ has « 
center which is separable over the field k.® 

We begin our investigations with the proof of some lemmas which will be 
quite useful later on. 


Lemma 1. If the ground field k and its finite algebraic extensions K 
possess no cyclic unramified extensions then there do not exist proper division 
algebras of jinite rank over k, i.e. k is quasi-algebraically closed.® 


Proof. Suppose that D is a proper normal division algebra of rank 
m==n? over k. The algebra D possesses always separable normal splitting 
fields U: 

DXU~U. 


7E. Witt, “ Schiefkérper iiber diskret bewerteten Kérpern,” Crelle, vol. 176 (1937) 

®° Cf. the paper mentioned under °. In the following investigation we shall assume 
that all division algebras considered possess residue algebras with separable centers 
over k. 

® A field is called quasi-algebraically closed if there exist no proper division algebras 
over it. 


a 
rs 
g 
ct 
e 
id 
1s 
is 
is 
3 
ot 
re 
se 
e 
g 
g 
)- 
h 
k 
e, 
| 


78 0. F. G. SCHILLING. 


Take such a splitting field and consider a Sylow subgroup %, belonging to a 
prime divisor g1 of n. Let the subfield of U belonging to it be U,, 
([U¢q:k],q) =1. Since %, is a solvable group we can draw a composition 


chain 


whose factors are cyclic groups of order g. The corresponding chain of fields 
may be given by 


Now consider the algebra D & U,*, it possesses the cyclic unramified field 
U =U,‘ as splitting field. Since such a field cannot exist according to our 
assumptions we see that D X U,‘*") must be similar to Ug”. Repeating 
this conclusion we observe that already 


D Uy. 


This last relation contradicts the choice of q for a well known theorem in the 
algebraic theory of normal algebras asserts that the degree of a splitting field 
of D must be a multiple of n, and we have shown that D—if it would exist— 


possesses U, as splitting field. 


Lemma 2. If k and all its finite algebraic extensions are never centers 
of cyclic proper algebras then k is quasi-algebraically closed. 


Proof. We can apply again the same argument, here the relation 
D X Uq~ Uz is a consequence of the supposed non-existence of cyclic algebras 


over U4. 


Lema 3. If all possibly existing division algebras D of degree n over k 
and its finite algebraic extensions K possess isomorphic maximally commuta- 
tive subfields and if —1 is a universal quadratic norm in case that x(k) #2 
then k is quasi-algebraically closed. 


Proof. Suppose that there exists a proper normal division algebra D of 
degree n = IIgi%‘ over k. The algebra D is similar to a direct product of primary 
division algebras D,; whose degrees are equal to qi’. In order to prove that 


10 An element is said to be a universal quadratic norm if it is the norm of a suitable 
element in any quadratic extension of the ground field. 


ds 


THE STRUCTURE OF LOCAL CLASS FIELD THEORY. 79 


our assumptions imply D ~ k it suffices to show that all primary algebras D; 
are similar to k. First we consider division algebras whose degrees are rela- 
tively prime to 2. 

Let D = (a, Z,) be a cyclic division algebra of degree 7. Then a” is the 
least power of a which is norm of an element in Z,. Hence k(a’/") is an 
algebraic extension of k of degree r, since a is a factor set the field k(a'/”) is a 
splitting field of D. Our assumption yields that Z, and k(a'/") are equal if 
taken in one aud the same algebraically closed field over k. But now the 
element @ is the norm of a quantity in Z, for r was supposed to be relatively 
prime to 2. Consequently D has to be similar to k. Next we apply Lemma 2 
which asserts that there do not exist proper division algebras over k as center 
of these primary algebras. Hence all primary algebras whose degrees are 
relatively prime to 2 are similar to k. 

If the degree of a primary algebra is a power of 2 we have to distinguish 
two different cases : 


(i) x(k) =2 and (ii) x(k) €2. 


Case (i). According to the theory of primary division algebras and their 
subalgebras it is sufficient to prove that there do not exist proper division 
algebras of quaternions over k and its finite algebraic extensions for our 
assumption is to hold over k and its extensions, and the argument using a 
Sylow subgroup belonging to the prime 2 of the Galois group of a normal 
splitting field applies here too. 

Suppose now that there exists a proper division algebra of degree 2 over 
k as center. Such an algebra Q has obviously the form 


Q = (a, k(C)) 
where C 640, =a, u'Cu=C-+1 witha,b~0 ink. Then 
our assumption implies that k(w) —k(C) in one and the same algebraic 


closure of k. Such an equality is impossible for k(w) is an inseparable ex- 
tension of k and k(C) is a separable extension. 


Case (ii). It suffices again to show that there do not exist proper di- 
vision algebras of quaternions over k and its finite algebraic extensions. Let 
Q = (a,k(b’””)) be such an algebra. Our assumptions imply first of all that 
k(a'/?) = k(b'””), i.e. b =ac? with c in k. Hence Q@ = (a,k(a’”)). The 
norm of a’/* is equal to —a. But a itself is also a norm since our special 
assumption, — 1 — Nd withd in k(a’’”), implies a=(— 1) (—a)= 


Remark. If we omit in the assumptions that — 1 be a universal quadratic 


a 

® 
on 
ld 
ur 
ng 
he 
1d 
on 
as 
k 
(l- 
2 
of 
at 
le 


80 O. F. G. SCHILLING. 


norm then we see that there exists exactly one division algebra over k, namely 
the algebra (—1,k(—1'/*)). For the existence of a division algebra 
(a, k(a‘/*)) implies that a is not the norm of an element in k(a’/*) hence 
—1 is not a norm and a fortiori not a square. The algebra (— 1, k(a’’”)) 
must be equivalent to (a, k(a’/*) ) as the assumption on the isomorphism of the 
maximally commutative subfields implies, consequently 


(a, k(a’”’)) 1, k(— 1/?)). 


THEOREM 1. If the field of residue classes k of the perfect field k is 

algebraically perfect the three following statements are equivalent: 
(i) the ramification degree e of each normal division algebra D of degree 

n over k equals n, 

(ii) the algebra of residue classes belonging to each normal division 
algebra D over k is a commutative field, and 

(iii) the maximally commutative ununified subfields of each division 
algebra D over k are tsomorphic and, in case of x(k) A 2, —1 is a unwwersal 
quadratic norm wm k. 


Proof. If ef —vn for a division algebra D then 
U, r(U, k) ) (n(o, U, r(U, k) ) 
U, r(U, k) ) (x, Za)’, 


where Z, is a maximally commutative cyclic unramified subfield of D and 
v is relatively prime to n. Moreover, the algebra has exponent n. The maximal 
order © of D contains the maximal order O(Z,) of Zn and 


D/P DO(Zn)/B =O (Zn) /P. 
Since [D/%:k] we have 
O/B = O(Zn)/®, 


i.e. the algebra of residue classes is a commutative field. 

Conversely, let D be a division algebra over k which satisfies (ii). Since 
k was assumed to be algebraically perfect the algebra D can be represented as 
a generalized crossed product 


D= (a(a, Y)> U) 


where U is a suitable maximally commutative unramified subfield of D and 
a, 8,y are elements of the Galois group belonging to the normal field con- 
taining U. Now (ii) implies 


THE STRUCTURE OF LOCAL CLASS FIELD THEORY. 81 


2 O(U)/P =O(V)/9. 


If the algebra O/9 has the degree n’ over its center K and K has the rank n’ 
over k the following relation holds between the different ranks 


f = [(O/8: k] = = vn” =n. 


Hence (i) and (ii) are equivalent statements. 

Obviously (i) and (ii) imply that all maximally commutative unramified 
subfields U, of division algebras D of degree n are isomorphic. Hence k is 
quasi-algebraically closed and —1 is a universal quadratic norm in k, thus 
(iii) holds. Conversely, (iii) implies according to Lemma 3 that k is quasi- 
algebraically closed. But this fact implies e—f—n. Thus (i) and (iii) 
are equivalent. 

Now let & be a perfect field for which any one of the statements of 
Theorem 1 holds. We wish to show by an example that such an assumption 
does not necessarily imply that the well known theorems of the local class 
field theory hold." 

Let k be the field of all roots of unity over the field of all rational numbers. 


00 
Then the field & = k{t} consisting of all formal power series  aj¢# in one 
i>-0O 


variable ¢ with coefficients a; in k is a perfect field. The unramified extensions 
U of k are fields of formal power series in ¢ whose coefficients are taken from 
a field which is isomorphic with the residue field U of U. Let U be a normal 
unramified field with the Galois group T(U,k) = {o,7,- - -}. Then the group 
G(U) consisting of all normal algebras over & which are split by U is iso- 
morphic with the factor group k)’ of the Galois group (UV, 
with respect to its commutator group for each algebra in G(U) has the form 


(7°, U, k)), 


where z denotes a fixed prime element of p. This is true because k is a quasi- 
algebraically closed field.!?, Moreover all division algebras D over i: are cyclic. 


Now let U be in particular a normal unramified extension of & whose Galois 
group is not solvable, such fields always exist. Namely, we have only to take 
anormal extension U’ over the field of all rational numbers and to form the 


™ For general statements about perfect fields also in later considerations see W. 
Krull, “ Allgemeine Bewertungstheorie,” Crelle, vol. 167 (1931). 

1H. Hasse, “ Die Struktur der R. Brauerschen Algebrenklassengruppe iiber einem 
algebraischen Zahlkérper,” Mathematische Annalen, vol. 107 (1933). 


6 


y 

) 
is 

n 
n 
1 

| 

| 


82 0. F. G. SCHILLING. 


join U—U’k.* The group of algebras belonging to such a field U is equal 
to one. 

If U is an abelian extension then G(U) is isomorphic with the Galois 
group T(U,k). In particular, the group G(Zn) belonging to a cyclic un- 
ramified extension Z, consists of a single cycle Z(n) of order n. 

Consider now the set of all division algebras D of degree and exponent n, 
these algebras can all be represented as cyclic crossed products 


(7, Zn)’; 


where the exponents vy are relatively prime to n. Observe that all algebras 
(a,Zn) whose factor sets are units in k, are similar to k. Thus the cyclic 
ramified extension k(x'/") of degree n over k is the universal splitting field 
of all algebras of degree n 


G(k(x¥/")) D all G(Z,). 


A simple argument using z-adic approximations yields that G(k(2'/")) 
is an infinite abelian group of type (n,”,- °°). 

Furthermore, we see that the factor group of norm classes k*/NU* 
belonging to an unramified abelian extension of degree is a cyclic group of 
order n whose generator is representable by ¢. In order to prove this assertion 
one has only to represent U as the join of a set of mutually distinct cyclic 
subfields and to determine the respective factor groups in a composition 
chain of U. 

Combining these results we observe 


(i) the group of classes of algebras possessing degree and exponent 
is infinite, 

(ii) the group of algebras G(U) split by an unramified field is a cycle 
of order n if and only if U is a cyclic field of degree n, 

(iii) the factor group of norm classes associated with an abelian un- 
ramified extension of degree n is a cycle of order n. 

Consequently most of the standard theorems of local class field theory 
do not hold for fields k supposing only that the residue fields k are quasi- 
algebraically closed. Thus we see that it is necessary to impose further con- 
ditions upon & in order that the old theory can be re-established. 


TurorEM 2. If the residue class field k of the perfect field k is quast- 


18 Such fields are for example the fields whose group is an alternating group of 


more than 4 variables. 


qual 


alois 
un- 


nt n, 


bras 
yclic 


field 


NU* 
p of 
‘tion 
yclie 
ition 


THE STRUCTURE OF LOCAL CLASS FIELD THEORY. 83 


algebraically closed and tf for each unramified normal extension U» of k there 
exists at least one normal dwision algebra Dn of degree n over k which is split 
by Un then all unramified extensions of k are cyclic. 


Proof. The first assumption implies that the group of algebras split by 
U, is isomorphic with the factor group [(Un,k)/T(Un, k)’. The other assump- 
tion Dn K Un—~ Un implies that G(Un) contains a proper division algebra of 
degree n. Hence G(U7,) is a cyclic group of order n. Moreover, the structural 
theory of D, yields that Un, must be a cyclic field. Thus we observe that all 
normal unramified fields U» are cyclic, consequently all unramified fields are 
cyclic and there exists exactly one cyclic unramified extension Z, of degree n 
over k. Or, the residue class field k possesses exactly one cyclic extension Zn 
for each degree n. 

Now it is very easy to prove that the local class field theory is valid in 
full extent for & if we assume furthermore that Theorem 3 holds also for all 
finite algebraic extensions K of k. According to the theory of C. Chevalley 
we can confine ourselves to show that each field Ky» of degree n over k is 
splitting field of the cyclic group of all algebras of degree n for all unramified 
fields are cyclic. Let D= (,Zn, on) be a generator of the group G(Zn). 
Then D & Ky ~ Kn for 


D Kn (x, Lakn/Ku, Kn) ) 
ZnKn/Kn, oe) ~1, 


where II denotes a prime element of K, and e, f denote the ramification and 
residue degrees of Kn respectively. Remark that Z,M Ky, = Z; is the inertial 
field of Ky as in the classical theory. 


THEOREM 2’. Jf each extension K,, of degree n over k is splitting field 
of all algebras of degree n then there exists for each integer n exactly one 


cyclic unramified extension Z, of degree n over k. 


Proof. For each integer n there exists at least one totally ramified field 
Kn, p=" in Ky. Our assumption yields that Kn is splitting field of a 
division algebra D,, hence the ramification degree ¢ of D is equal to n. Con- 
sequently we can apply Theorem 2 together with Lemma 3 and see that the 
unramified extensions U, of k are all cyclic. 


THEOREM 3. All unramified extensions of k are cyclic if the groups of 


**C. Chevalley, “ La théorie du symbole de restes normiques,” Crelle, vol. 169 (1932). 
** Cf. Theorem 0 in the paper of Chevalley. 


eory 
asi- 
con- 


84 O. F. G. SCHILLING. 


algebras G(U,) which belong to the unramified extensions U» of degree n over 
k are cyclic groups of order n and if this property holds also for the respective 
groups over the finite algebraic extensions of the groundfield k. 


Proof. According to the Galois theory it suffices to show that there exists 
for each degree n exactly one field Un. Suppose then that Uy,’ and Un” are 
two distinct unramified normal fields of degree n then the relative degree i 
of their join U,’U,” is greater than n. Hence G(U,/U,’’) is a cyclic group 


of order 7 containing the two cyclic groups G(U»’) and G(U,””) both of order 
n. Consequently G(Un’) and G(U,»”) coincide. Now let D be an arbitrary 
normal division algebra of degree n lying in G(Un’) = G(U,”). The algebra 
D possesses Uy’ and U,,” as maximally commutative subfields. Our assumption 
implies in particular that there exist cyclic unramified extensions of degree 1 
over k, namely otherwise we would arrive at a contradiction to Lemma 1. 
Consequently U,’ and U,’”” must be equal to a cyclic field if they are considered 
in the same algebraic closure of k:. 

In the preceding theorems we imposed conditions on the finite algebraic 
extensions of k as well as on the normal division algebras over k. We now 
wish to investigate the properties of perfect fields & for which only certain 
postulates concerning the normal division algebras are assumed, the condition: 
imposed upon the field & wil’ se of purely commutative type. 


THEorEM 4. [f all classes of normal algebras which are representable by§h 
division algebras of degree n form a group of order n and if for each prime yf tl 
there exists at least one cyclic unramified extension Z, of degree q over k then§ {I 

‘ 


the cyclic unramified extensions of degree n are unique. 

Proof. It suffices to prove that there exists at least one cyclic unramified 
extension Z, over k for each degree n. Namely, if Z, is such an unramified 
field then for 


[G(Zn) 21] = 1] = 1] 1] 
n[k*/NZ*,: 1], 


and since G(Z,) is contained in the group of all algebras of degree n which }: 


supposed to have the order n, we see that wa! 
G(Zn) is a cyclic group of order n which coincides 
with the group of all algebras of degree n. — 


This statement is true for all cyclic unramified fields of degree n, more4(ye) 
over each generating algebra of the group of algebras of degree n is totall 


over 
clive 


” are 
ree ii 
order 
itrary 
gebra 
ption 
Tee 
ma |. 
dered 


ebral¢ 
e now 
ertalli 
litions 


ible by 
rvme || 


then 


imified 
amified 


thich 


}, mort 
totall 


THE STRUCTURE OF LOCAL CLASS FIELD THEORY. 85 


ramified and possesses all cyclic unramified fields of degree n as maximally 
commutative subfields. Hence these cyclic fields coincide in one and the same 
algebraic closure of k. 

According to the Galois theory it is sufficient to show that there exist 
unramified cyclic fields 7,” whose degrees are powers of a single prime q. 

We distinguish two cases 


(i) g= x(k) and (ii) x(k). 


Case i. Since there exists at least one unramified cyclic field Zq there 
exists at least one cyclic extension Z, of degree q over the residue class field k. 
According to the theory of cyclic extensions of degree q’ over fields of char- 
acteristic q there consequently must exist such extensions Zg’.1° The corre- 
sponding perfect fields 7,” are unramified and cyclic possessing the fields 
Z? as residue class 


Case ii. Again it is sufficient to consider the cyclic extensions of the 
residue class field k. The field & contains a certain maximal q¥-th root of 
unity, where WN is either a finite integer—then the q‘*'-st roots of unity do not 
lie in kK—or N is put equal to c in a formal sense—then k contains all g¥-th 
roots of unity for arbitrarily great N. In the first case the g%*’-th roots of 
unity determine cyclic extensions of degree gq’ for all such cyclotomic fields 
have a cyclic Galois group. In the second case our assumption yields too 
the existence of cyclic fields Z, for any v. Namely, it implies that the index 
[|k*: k*4] is divisible by g. Let a be a representative of k*/k*4 which is 
not equal to a g-th power, then the radicals a’/”” generate cyclic fields 
Ly = of degree q’. 


Remark. The result of Theorem 4 can also be obtained if we substitute 
ior the first assumption the following 


all classes of normal algebras over & possessing the 
exponent n form a set of n elements. 


First we observe that all these classes form a group for if D, and D, are 
iny two elements in the set, i.e. D," ~ D." ~ k, then 


Dz2)"~ Di" K and (Di")1~k. 


1° E. Witt, “ Zyklische Kérper und Algebren der Characteristik p vom Grade p",” 
(relle, vol. 176 (1937). 

17H. Hasse, “ Die Gruppe der p"-primiren Zahlen fiir einen Primteiler p von p,” 
"rele, vol. 176 (1937). 


86 O. F. G. SCHILLING. 


Furthermore this group of algebras is cyclic; in order to prove this assertion 
it is sufficient to show that all groups of algebras possessing exponents q‘ where 
q is a prime are cyclic. But this is obvious since the group having exponent q' 
contains the group having exponent qg‘* and since the factor group has the 
order g according to our assumption. Now we may reason as before, the 
group of algebras of exponent qg‘ contains in particular the ramified division 
algebras (x, Z,‘) belonging to the cyclic unramified fields Z,'. 

We wish to observe that the unique existence of the cyclic unramified 
fields Z, for each degree n over k does not imply that these fields are the only 
existing unramified extensions over k. Namely consider the following example. 
Let ky be the field of all roots of unity over the field of-all rational numbers; 
let a’ ~0 be an arbitrary element of k, then there exists a maximal integer 
N(a@’) such that a@/N@” lies in but not a’/N@” for y>1. Put 
a=a'/N@, Next adjoin to ky all solutions of all solvable equations whose 
normal fields do not possess the fields ky(a’/"), n > 1, as subfields. Let the 
resulting enumerable algebraic extension of ky be k,. The field k, is also quasi- 
algebraically closed, moreover k,(a'/”) are cyclic extensions of degree n over k,. 
Adjoin to k, again all solutions of all solvable equations in k, which do not con- 
tain a’/", etc. The field k which we obtain finally is quasi-algebraically closed 
and it does not contain a'/", moreover there do exist non-solvable extensions of 
sufficiently high degrees over k. The perfect field i be now the field of all 
formal power series in one variable ¢ with coefficients in k. Evidentally all 
normal division algebras of degree n over k are powers of the algebra 
(t, k(a’/") {t}/k). Hence they form a cyclic group of classes of algebras having 
order n. The assumptions of Theorem 4 are fulfilled by the field &, but there 
exist other unramified extensions U, over k which are not cyclic. Thus we see 
that the statement of Theorem 4 is the best possible. 

In the following investigations we shall be concerned with the implications 
for the perfect field & if we assume certain properties to hold for the factor 
group of norm classes. 


Lemma 4. If the class group k*/NZ*,, is cyclic of order n for all un- 
ramified cyclic extensions of degree n and if the same is true for all unramified 
extensions U of k then the field of residue classes k is quasi-algebraically closed. 


Proof. The assumption k*/NZ*, = Z(n) yields that 9 
k*/NZ*, = 1, 
for 1] = 1] [e/NE: 1] 
= n[k*/Z*,:1]. 


tion 


here 


It 
the 
the 


sion 


fied 
ple. 
ers; 
ger 
Put 
10se 

the 
asi- 


k,. 


THE STRUCTURE OF LOCAL CLASS FIELD THEORY. 87 


Moreover, we observe that the field of residue classes k is algebraically 
perfect. For otherwise there would exist proper cyclic and abelian division 
algebras D over k whose degrees are powers of the characteristic x(k).** 
There would exist at least one cyclic unramified field Zq over k whose field of 
residue classes is equal to a cyclic field Z, which is splitting field of a normal 
division algebra over k.!® The norm class group k*/NZ*, has then an order 
which is greater than q in contradiction to k*/NZ*, = Z(q). 

Next let D be an arbitrary normal division algebra over k, we must show 
that D~k. The algebra D possesses at least one normal splitting field U: 
D~ (a(o,7),U,0(U,k)). Let q be a prime divisor of the degree m of D 
which is supposed to be greater than 1. %, be a Sylow subgroup of T'(U, k) 
belonging to the prime q, and let U, be the corresponding subfield of U, then 
[U,:k] 0 (mod q). Since 3, is solvable there exist at least one chain of 
fields between U, and U 


such that U,‘ are cyclic extensions of degree gy over +, 8. 
Consider now the algebra D X U,, it will be similar to Uy as the argument 
used in the preceding Lemma 2 and the relation k*/NZ*, —1 which is 
supposed to hold also for the finite extensions of k, show. Again D x U,;~ U, 
leads to a contradiction. Hence k is quasi-algebraically closed, moreover it is 
algebraically perfect for fields k which are not algebraically perfect are centers 


of normal division algebras. 


THEorREM 5. If k*/NA* =T(A,k) holds for all unramified abelian ex- 
tensions A of k then there exists for each degree n a uniquely determined cyclic 
field of degree n. 


Proof. Let A be an arbitrary unramified abelian field of degree n over k, 
then 
[k*/NA*: 1] = 1] [e/NE: 1]. 
Hence 
T(A,k) =Z(n) and «e/NE=k*/NA* =1, 


i.e, all unramified abelian fields are cyclic. It is then obvious that there exists 
for each degree n exactly one cyclic unramified extension Z, of k. 


#7 A. A. Albert, “ Normal division algebras over a modular field,’ Transactions of 
the American Mathematical Society, vol. 36 (1934); G. Kéthe, “ Uber Schiefkérper mit 
Unterkérpern zweiter Art tiber dem Zentrum,” Crelle, vol. 166 (1932). 

*T. Nakayama, “Uber die Algebren iiber einem Kérper von der Primzahlchar- 
akteristik, II,” Proceedings of the Imperial Academy of Tokyo, vol. 12 (1937). 


‘On- 
sed 

of 

all 

all 

bra 
ing 

ere 

see 
ons 

tor 

n- 

ed 


88 O. F. G. SCHILLING. 


TueEorEM 6. If k*/NU* =T(A,k) holds for all normal unramified 
extensions U over k where A denotes the maximal abelian subfield of U then 


there exist only cyclic unramified extensions Z» over k. 


Proof. et U» be an arbitrary unramified normal extension of k then 


[k*/NU*,: 1] = [{r}/{r}*: 1] [¢/NE: 1] =n, 
for 
[k*/NU*,: 1] [k*/NA*:1] Sn. 
Consequently 
r(A,k) =Z(n), 
hence 
r(U,,k) =Z(n) 


for [(A,k&) is a homomorphic map of T(Un, k). 

Again as in Lemma 4 we see that k is algebraically perfect. Moreover, 
the field k is quasi-algebraically closed. Since k is algebraically perfect there 
do not exist normal division algebras of degree x(k)” over k. Let then D be a 
normal division algebra over k whose degree is relatively prime to x(k). 
Such an algebra is cyclic for k possesses only cyclic extensions Zn. The 
algebra D is the algebra of residue classes of a normal division algebra D 
over k belonging to G(Z,)where Z, is the cyclic unramified extension of k 
corresponding to Zn. Since G(Z,) is a cyclic group of order n containing only 
ramified algebras the assumption D~ k leads to a contradiction. 


Lemma 5. If [k*/NZ*,:1] =x holds for all cyclic extensions of degree 
n over k then there exist cyclic unramified extensions of degree n with 


respect to k. 
Proof. We distinguish two cases 
(i) (n,x(k)) =1 and (ii) n= y(k)’. 


It is obviously sufficient to prove the assertion of the lemma for degrees 


n which are powers of a single prime q. 


Case i. Suppose that there do not exist cyclic unramified extensions Z,’ 
of degree q’, then there do not exist cyclic extensions Z,” of degree q’ over the 
field of residue classes k. Hence the fields k and & respectively must contain 
all g’-th roots of unity. Moreover, [k*: = [k*:k*”] =1. For if 
[k*: k*2] were divisible by q then any representative a of k*/k*4 which is not 
equivalent to 1 mod k*? would generate cyclic extensions Z,” = k(a‘/””) of 


4 


ver, 
vere 
be a 
k). 
The 
D 
k 


gree 
pith 


Trees 


THE STRUCTURE OF LOCAL CLASS FIELD THEORY. 89 


degree gq” over k. Since [k*/NZ*,:1] =n for all unramified fields 7, 
implies that k is algebraically perfect there is a one to one correspondence 
between the cyclic unramified extensions 7,” over k and the cyclic extensions 
over k, the field = k(a’/””) must be equal to k. 

) Next [k*: k*2”] =1 implies that all elements of k are q’-th powers. 
(Consequently all units « of & are q’-th powers as a simple p-adic approximation 
vields. 

Now consider the cyclic ramified field Kg = = k((er)/9”), The 
group of all norms of elements in K,’ contains the group of all units « of k 
for they are all q’-th powers as we have just observed. Moreover NK*,” con- 
tains the prime element 7, namely if then and if g = 2 
then —1 is a square or 2’-th power in & consequently ——1-:—za is a 
norm. Hence NK*,” = k* in contradiction to the assumption of the Lemma. 
Thus we find that / must possess cyclic unramified extensions Z,’, they will 
be found among the cyclotomic extensions if {~V (N finite) is the maximal 
root of unity lving in & or they will be radical fields if all q’-th roots of unity 


lie in k. 


Case ii. The assumption yields that there must exist proper division 
algebras D of degree q’ over k, namely among possible other algebras the 
generators D,” of the cyclic groups G(Z’,”) where Z’7” denotes a ramified 
cyclic extension of degree q” over k. The existence of such fields has been 
established in the general theory of cyclic extensions over perfect fields whose 
residue class fields k are algebraically perfect.*° 

Now let Dy be an arbitrary division algebra of degree q’ over k, it 
possesses at least one unramified normal splitting field U for k is algebraically 


perfect : 
Dy ~ (a(o,7), U, T(U,k)) ~ U, k)) (n(o, 7), U, ). 


The algebra (y(o,7),U0,0T(U,k%)) which could be similar to an unrami- 
fied algebra of degree g* (uv) must be similar to & for its residue algebra 
(n(o, 7) mod p, U, r(U,k)) is similar to k, k being an algebraically perfect 
field.22_ Hence 

Dy U, T(U, kk) ) ~ (9, Ze)’, 


where Z, denotes the unramified cyclic subfield of U which corresponds to the 
character of induced by the set of addends e(o, 7). 


2° Cf. the two papers mentioned in footnotes 16 and 17. 
"1. Witt, “ Schiefkérper tiber diskret bewerteten Korpern,” Crelle, vol. 176 (1937). 


ed 
Len 
n 
the 
if 
not 


O. F. G. SCHILLING. 


We have p =v for D,” was supposed to be a normal division algebra of 
degree q’, thus the existence of cyclic unramified fields of degree q’ is estab- 
lished. Moreover we observe that all algebras D,” are cyclic and ramified. 


THEOREM 7. If [k*/NZ*,:1] =n holds for all cyclic extensions Z,, 
over k and similarly for all cyclic extensions over the finite algebraic extensions 
of k then 


(i) all cyclic unramified fields of degree n over k coincide, 
(ii) the field of residue classes k is algebraically perfect, and 
(iii) k*/NA* =T(A,k) holds for all abelian extensions A over k. 


Proof. First we observe that Lemma 5 implies the existence of cyclic 
unramified fields Z, of prime degree g. If gs4 x(k) then there exist cyclic 
unramified extensions 7,” for arbitrary vy. Namely, either cyclotomic subfields 
of k(£,N+”)—if is the maximal g*-th root of unity lying in /—represent 
such fields or radical extensions k(a'/””) where Z, = k(a'/2) is one of the cyclic 
unramified extensions which must exist according to Lemma 5. If q = x(k) 
then cyclic unramified extensions must exist according to the theory of 
q-algebras if the latter is applied to perfect fields for which our assumption 
holds. 

Lemma 5 implies furthermore that k is algebraically perfect. Hence 
there can be established a one to one correspondence between the unramified 
division algebras over k and the division algebras over k. 

In order to prove that all cyclic unramified fields of degree n coincide 
it is sufficient to prove that all cyclic unramified fields of prime degree q’ 
coincide for each gq. Let then Z, and Z’, be two cyclic unramified fields of 
degree g over k. Their respective groups of algebras G(Zq) and G(Z’q) have 
the generators (7,7, ) and (x, Z’q) respectively. Consequently—if ¢~ x(k)— 
the separable field k('/2) is a common splitting field of G(7Z,_) and G(Zq). 
Now assume that & contains the g-th roots of unity then k(2'/“) is a cyclic 
extension of degree q over k. Our assumption implies 


G(Zq) = G(Z'q) = G(k (44) ) = Z(q). 
Since each proper division algebra Dg in G(k(m'/2)) is ramified and since 
Zq and Z’, are both unramified cyclic splitting fields of Dg we get for the 
algebra of residue classes 
D/B = or Z,=Z/ over k, 


hence Z, and Z’, are abstractly isomorphic which amounts exactly to our 
assertion. 


clic 
clic 
lds 
sent 
clic 
(k) 
of 


tion 


once 


ified 


cide 
is of 
have 
yelic 


since 


r the 


) our 


THE STRUCTURE OF LOCAL CLASS FIELD THEORY. 91 


If k does not contain the q-th roots of unity then consider the extensions 
of G(Zq) and G(Z’q) by the cyclotomic field Since [k(£q):k] is 
relatively prime to q the general algebraic theory of splitting fields yields that 
the algebras in G'(Z,) and G(Z’q) to not become similar to k(£q) if the center 
is extended to k({q). The field k(2'/4)k({q) is a cyclic extension of degree 4 
over the new ground field /(,). As in the previous case our assumption yields 
that the extended groups coincide. Hence the fields Zgk(fq) and Z’gk (fq) 
which are cyclic of degree ¢(q—1) over k coincide, consequently according 
to the Galois theory Zg = 2g. 

A treatment of the case q = x(k) = x(k) can be found in the literature.** 

Next we show that the field of residue classes k is quasi-algebraically 
closed. Our assumption implies in particular that k*/NZ*, = Z(n) for all 
cyclic unramified extensions of degree n. Consequently there do not exist 
cyclic algebras over k for such algebras would be residue algebras of unramified 
cyclic algebras over k. Furthermore our general assumption implies the same 
for all finite algebraic extensions over k. Hence we can apply Lemma 2 and 
we see that k is quasi-algebraically closed. 

An immediate consequence according to Theorem 1 and the algebraic 
theory is the fact that all normal division algebras of degree n over k are 
ramified and that they possess cyclic unramified splitting fields Z,, moreover 
degree and exponent of all division algebras coincide. Since the fields Z, are 
uniquely determined by their degrees n we see that all algebras which are 
representable by division algebras of degree n form a cyclic group of order n. 

Next we prove that all abelian extensions A, of degree n over k are splitting 
fields of the class group of all algebras of degree n over bk. Let Z; be the 
inertial field of a fixed abelian field A», then p= B%"// in A, and 7 = EII"// 
with a suitable unit E of Ay. Consider now a generating algebra (2, Zn, on) 
of G(Z,) and form the direct product (2, Zn,on) X An then 


on) x An ZnAn/A Ny On/f) (E, ZnAn/An, on/f) 
(II, ZnAn/A ns on) ~w 


for our assumptions imply that A*,/N(ZnAn)* = Z(n/f) and ZrAn is the 
unramified cyclic extension of degree n/f over An. Hence A, is a splitting 
field of the uniquely determined class group of algebras G(Zp). 
Now we prove that [k*/NS*:1] = [S:k] for any solvable extension 
Sofk. Since § is solvable it possesses a chain of subfields 


*2 See footnote 20; consider the algebras (a,m!) which have the same unramified 
splitting field. 


42 O. F. G. SCHILLING. 


such that always A; is a cyclic extension over Aij-.. The cyclicity of G(Z,) 
vields that the norm factor group for any cyclic extension over k is cyclic, 
according to our general assumption the same is true for all finite algebraic 
extensions K over k. Thus [A*;_,: NA*;] =[Az: Ai]. Using the transi- 
tive property of the norm and induction by 1 one readily observes that 

Now let An be an Abelian field over k, there always exists a division 
algebra Dn = (a(o,7), An, T(An, &)) in G(Zn) as we have seen before. Hence 


the function f(¢) a(o,7) determines an isomorphism between An, k) 
(7) 


and a subgroup of k*/NA*,, consequently 


k*/NA*, = k) 
as asserted.”4 


Remark. Our assumptions do not exclude the existence of non-solvable 
equations in & as the example on page 86 shows. 

The uniqueness of the cyclic unramified extensions in Theorem 6 was a con- 
sequence of the postulated isomorphism k*/NA* =1(A,k). This isomorphism 
amounts to the fact that the norm factor group of a cyclic unramified extension 
over another cyclic unramified extension over the ground field & is a cyclic 
group whose order is equal to the respective relative degree. The same result 
as in Theorem 6 can be obtained if we postulate for & 


(i) k*/NZ*, = Z(n) for all cyclic unramified fields, 
(ii) if Z, and Zm are two cyclic fields then 


NZ*, NZ*y, = N(ZnZm)*, oF 
{NZ*,, NZ*n} = N(Zn Zm)*. 


Obviously it suffices to prove the uniqueness for cyclic unramified fields Z,” 
and Zg” whose degrees are powers of one and the same prime g. Assume that 
and Z#” are distinct. Then 


*8 For a model of the proof see F. K. Schmidt, “Zur Klassenkérpertheorie im 
Kleinen,” Crelle, vol. 162 (1930). 

%4T, Nakayama, “tber die Beziehungen zwischen den Faktorensystemen und der 
Normenklassengruppe eines galoisschen Erweiterungskérpers,”’ Mathematische Annalen, 
vol. 112 (1935). 


ble 


on- 
ism 
ion 
clic 
ult 


im 


THE STRUCTURE OF LOCAL CLASS FIELD THEORY. 93 


Since both fields are unramified the group of units « in k is contained in NZ,q’* 
as well as in NZ,“”* assumptions (ii) and (ii’) together with (i) yield 


N )* q™ and N(Z_" n Zt”) * q”, 


M = max(p,v) and m=min(y,v). But surely > 
and [k*/N (Ze 9 Zy")*:1] <q". Hence must be a subfield of 7,” 
whereby the uniqueness is established. 


Lemma 6. If for each subgroup H of the multiplicative group k* be- 
longing to the perfect field k there exists a uniquely determined abelian field 
K of finite degree over k such that NK* =H(K) =H and if H, < H. 
implies Kz < K, then 

[k*/H:1] =[K:k]. 


Proof. Let H be an arbitrary subgroup of k* and let K be the associated 
abelian field over & such that H(K) =H. The Galois group of K with 
respect to k be denoted by '(K,k) =Tf. Consider now an arbitrary chain of 
subfields between K and k 


< Ki, < < Ki<:: < Ke =K 


such that K; is a cyclic extension K;_, whose degree is a prime, i = 1, 2,° ° -,s 
The associated norm groups Nik*; = H(K;) = H; form a descending chain 


as a simple consideration shows. According to our assumption there holds « 
one to one correspondence between the subgroups H; and the subfields K;, K, 
being the uniquely determined abelian extension of k belonging to H; = NiK*;, 

Always H;_, > Hi, for if =H; then also K;_, K; according to 
the second part of our assumption. Moreover, the factor groups Hi-,/H; are 
cyclic groups of prime order for the existence of a proper subgroup H’ of H;_, 
which is different from H; would imply the existence of a field K’ such that 
Ri, < K’ < Ky. 


& 
Hence [k*/H:1] [Hi-./Hi: 1] is finite. 
i=1 


Now let H be a subgroup of &* such that [&*/H:1] = q? where q is « 
rational prime. We draw a composition series 


k= 


in) 
‘lic, 
raic 
nsi- 
hat 
‘ion 
nce 
, k) 
y’ 
qd 
at 
|_| 
ner 
en, 


94 O. F. G. SCHILLING. 


between k* and H such that the factor groups Hi_,/H; are cyclic groups of 
order g. The associated abelian extensions of k be 


evidentally K; is always a cyclic extension of prime degree qj over Kj, 
i=1,2,:--. Next we show that g; gq. Consider the field K, over k then 
k > H, > k*%, since [k*: H,] =q we get q1 gq. Then induction implies 
*=dp=q. Hence obviously 


[k*/H:1] =[K:k]. 


r 
Now let H be a subgroup of index n = ][ qi’ under k. According to a 
i=1 


group theoretical theorem we obtain H =H” n---n H™ where H™ are 
subgroups of index g;** under k*. The Galois theory combined with the 
assumptions of the Lemma yields that the field K belonging to H is the join 
of the fields K‘” belonging to the groups H“. Hence [k*/H:1] =[K:k]. 


Corotuary. For each finite abelian extension K over k holds 
(k*/H(K):1] —[K:k]. 


Proof. We draw again a composition chain between k and H(K). The 
liniteness of the degree [K:k] implies according to the assumptions of the 
lemma that the index [k*/H(K):1] is finite. The uniqueness of K as the 
field belonging to H(K) asserts together with Lemma 6 that 


[k*/H(K):1] =[K:k]. 


THEOREM 8. Under the same assumptions as in Lemma 6 it follows that 
all abelian unramified fields Uy are cycle. 


Proof. Let U, be an arbitrary abelian unramified field of degree n over k. 
Then H(U,) = ({7}", NE). Since [k*: H(U,)] =n according to the corol- 
lary we must have NE=e. The last equality holds for all abelian unramified 
extensions U,, hence their norm groups coincide and they must be cyclic for 
this conclusion holds in particular for all prime degrees q. 


THEOREM 9. Jf the abelian field K over k belonging to the group H 1s 
contained in all extensions L over k whose norm groups H(L) are subgroups 


The 
f the 
s the 


that 


yer k. 
sorol- 
nified 
ic for 


Hw 
roups 


THE STRUCTURE OF LOCAL CLASS FIELD THEORY. 95 


of H and if the assumptions of Lemma 6 hold for k then all unramified finite 
extensions Uy» of k are cyclic. 


Proof. (et U, be an unramified separable field of degree n over k then 
H(Un) = ({r}", NE). The group NE is a subgroup of ¢ the group of all 
units in k, hence H(Un) € ({2}",¢). Let the cyclic unramified extension be- 
longing to ({r}",e) according to Theorem 8 be Zn. Then our assumption 
vields that Z, Co Un. Consequently Zn, =U, for both fields have the same 
degree. 


Remark. A further consequence of the assumptions in Lemma 6 is the 
fact that the field of residue classes k is algebraically perfect. Namely 
Theorem 8 yields that k*/NZ*, = Z(n) holds for all unramified extensions, 
hence there do not exist proper division algebras of degree x(k)” over k as 
previous considerations show. 

The most important conclusion obtained in the previous theorems was the 
fact that for each integer n there exists a uniquely determined cyclic unrami- 
fied extension Z, of degree n over k. In some theorems we had to assume that 
the field of residue classes be an algebraically perfect field mostly then if we 
made use of the algebraic theory of division algebras D over k in the proofs. 
However, in theorems of this type it suffices in general that the assumptions 
are postulated for all normal division algebras which possess residue division 
algebras whose centers are separable over k. Our aim is to infer from the 
postulates that & is a Galois field. In order to achieve this it is necessary to 
impose further conditions on the perfect field *. Namely, there exist examples 
of perfect fields for which most of the theorems hold although the residue fields 
are not Galois fields. For example the fields of formal power series in two 
variables over an algebraically closed field of characteristic 0 where the residue 
field belonging to only existing isolated subgroup of rank one contained in the 
valuation group of the field is considered as the residue field of the discrete 
archimedian valuation of the field.*® 

The restriction to be imposed on & should not contain a postulate with 
regard to the characteristic of k. Thus we are lead to the following assumption 
the postulates in the theorems asserting the uniqueness of the unramified 
cyclic extensions over k shall also be true for all perfect subfields k’ of k whose 
residue class fields k’ are subfields of k. 


TuroreM 10. If a field k satisfying any one of the assumptions implying 


2°Q. F. G. Schilling, “ Arithmetic in fields of formal power series,” Annals of 
Mathematics, vol. 38 (1937). 


4-1) 
chen 
lies 
to a 
are 
the 
join 
: 


96 O. F. G. SCHILLING. 


the uniqueness of the cyclic unramified extensions fulfills the postulate abou! 
its perfect subfields then k is a perfect field whose field of residue classes k 
ts a Galots field whose G-number possesses no infinite part.?° 


Proof. The field of residue classes k must have the property that all its 
subfields k’ admit exactly one cyclic extension of degree n, hence in particular 
the prime field of characteristic x(k) which lies in k. Thus fields k of char- 
acteristic 0 are ruled out for the field of all rational numbers possesses in- 
finitely many cyclic extensions for each degree n. For the same reason it 
follows that k must be algebraic over its prime field if x(k) #0. Namely, 
if k would contain a transcendental quantity ¢ over the prime field then the 
field of all rational functions of ¢ over the prime field had to possess exactly 
one cyclic extension of degree n for each integer n, an implication which 
obviously is false. 

Now it remains to be proved that the infinite part of the G-number 
belonging to k is equal to one. Assume that qg is a divisor of Gint(k). Then 
k possesses no cyclic extensions of degree q’ over k, consequently the assump- 
tions which lead to the uniqueness of the cyclic unramified extensions over / 
are violated for all degrees which are divisible by g. An extensive investiga- 
tion leading up to the preceding assertion has already been made.*7 Hence 
Gint(k) = 1. 

We see that the residue fields k which result are algebraically perfect. 
Therefore the structure of the perfect fields & is determined by the fields k.* 
In a certain sense they are the p-adic fields belonging to algebraic number 
fields and fields of functions of one variable whose fields of coefficients are 
Galois fields; these are exactly the fields for which the local class field theor\ 
has first been developed. We wish to mention that the perfect fields which can 
be characterized by topological properties belong to the class of perfect fields 
we studied here.” Thus we have studied an algebraico-arithmetical counter- 
part of N. Jacobson’s theory. 

It was quite material in our investigations that the value group &(k) 
belonging to the perfect field was isomorphic with the additive group of all 
rational integers. It implies in particular that the prime ideal p of & can be 
generated by a prime element z. Consequently we could speak of a ramifica- 


*° For the pertaining notations and definitions see the paper mentioned in footnote I. 

27See paper mentioned under * and an additional note in the same journal. 

*°H. Hasse and F. K. Schmidt, “ Die Struktur diskret bewerteter Kérper,” Crelle, 
vol. 170 (1933). 

2° N. Jacobson, “Totally disconnected locally compact rings,’ American Journal 
of Mathematics, vol. 58 (1936) 


boul 
esk 


1 its 
‘ular 
har- 
3 in- 
mn it 
nely, 
the 
actly 
hich 


mber 
hen 
rer 
tiga- 
fence 


fect. 
mber 
3 are 
1e0ry 
1 Call 
fields 
nter- 


B 
yf all 
un. be 
ifica- 


1ote |. 


Yrelle, 


yurnal 


THE STRUCTURE OF LOCAL CLASS FIELD THEORY. 97 


tion degree e belonging to a finite algebraic extension of k or to a normal 
division algebra over k. However, one can find a substitute for this definition 
of ramification. Namely, if e >1 then [D/$:0/p] = n/e <n in the case 
that B(k) is isomorphic to the additive group of all integers, if @(k) is not 
discrete then we shall say that K is ramified over k if [O/®:0/p]<n=([K:k]. 
One easily observes that this is a workable definition for arithmetical problems. 

More important are the consequences of the non-discreteness of B(k) for 
the algebraic theory of normal division algebras. We made ample use of the 
fact that each division algebra which possesses a residue algebra whose center 
is separable over k can be represented as the product of a ramified and an 
unramified algebra 


D~ (a(o, U, r(U, k)) 
U, r(U, k)) (n(e, 7), U, r(U, k)). 


For general value groups 8(k) of rank one such a decomposition is not 
obvious for there do not exist generating prime elements of the prime ideal 
belonging to the valuation B(k). Nevertheless, under restricting conditions 
on the structure of the field k we are able to find a substitute for such decom- 
positions; but it turns out that such decompositions can no more be made 
unique as it was possible in the case of discrete valuations by fixing the prime 
element +. Let us assume that there exists at least one discrete perfect sub- 
field k’ of k which has the same field of residue classes k as the given field k, 
and that there exists a—necessarily infinite—algebraic extension k” of k’ which 
is everywhere dense in k, that is to say whose derived field with respect to the 
valuation is equal to k. The values assumed by the elements of k” form a 
group of rational numbers &,(k’”) whose structure is determined by the G- 
degree of k’” over k’. In order to see that one has to observe that any number 
of k” lies in a finite perfect extension of k’ and that its value is determined 
by the relative degree of that extension and that the value does not depend 
upon the particular finite field used for its determination. Analyzing the 
structure of the group B-(k”) we see that certain elements of it may possess 
arbitrarily high powers of primes as denominators. The set of such rational 
primes shall be called the infinite characteristic of &.(k”’), it is determined 
by the infinite part of the G-degree [k”: k’]; we collect all these primes q in a 
formal product JJ ¢ = G. 


(q) 
Now it can be shown that each algebra D over & whose algebra of residue 


classes has a separable center over k is equivalent to the direct product of a 


7 


98 O. F. G. SCHILLING. 


suitable algebra D” over k” with k.*° Thus the study of normal algebras over 
k is reduced to the investigation of the normal algebras over k”’. 
Now let D” be an arbitrary division algebra over k”, 


~ (a(a,r)”, 0", 0(U",k”)), 
U” =Kk/’(A”) where A” + +--+ -+a%=0 with a”; in k”, 


Consider now an arbitrary finite algebraic extension ks of k’ which con- 
tains the elements a(o,7)” and a”;. Then U” =k.(A”)k” and 


D” ~ (a(o, 7)”, ke(A”) /ko, T) ~ De X 


Since ks is a finite algebraic extension of the discrete perfect field k&’ there 


exists a prime element z in ks, hence 
Ds ke(A”) /ke, (n(o, T) A”) /kes, 


where the first factor represents a ramified algebra and the second factor stands 
for an unramified algebra. The algebras 


(n(o, T) +, ke(A”) /ke, x< 


are at most similar to a ramified or an unramified division algebra over |” 
respectively, where the ramification is understood to be measured according to 


8°Cf. Note 1. The result essentially used here can be generalized as follows: 
“If k” is an everywhere dense subfield of k such that the residue class fields of k” and 
k coincide, every separable extension k(A) of degree n over k is equal to the join of k 
with a suitable separable extension k”(A”) of degree n over k”.” <A proof of this 
theorem can be obtained by generalizing a proof of M. Moriya in “ Klassenkérpertheorie 
im Kleinen fur die unendlichen algebraischen Zahlkérper ” (Journal of the Fac. of 
Sci. Hokkaido Imp. Univer., ser. I, vol. 5 (1936), Sapporo, Japan). Using the notations 
loc. cit., p. 13, we observe that one can take instead of k any everywhere dense subfield 
k” of k. Furthermore, in the general case one has to use for the construction of the 
equation for A” the theory of abstract derivation as developed by H. Hasse in “ Noch 
eine Begriindung der Theorie der héheren Differentialquotienten in einem algebraischen 
Funktionenkérper einen Unbestimmten ” (Crelle, vol. 177 (1937). The proof that 
the constructed equation for A” is irreducible in k” resp. k can be reduced to a theorem 
of F. K. Schmidt stating that two polynomials of degree n whose distance is sufficiently 
small in the metric space of all polynomials of degree n, have the same type of decom- 
position. Cf. F. K. Schmidt, “ Mehrfach perfekte Kérper ” in Mathematische Annalen, 
vol. 108 (1933). Our theorem is rather helpful for the investigation of the structural 
theory of general perfect fields whose value groups are isomorphic with non-discrete 
subgroups the additive group of all real numbers. 


‘ 

i 


over 


con- 


here 


ands 


to 


and 
of k 
this 
1e0rle 
ic. of 
itions 
bfield 
f the 
Noch 
schen 
that 
iently 
ecom- 
nalen, 
-tural 
screte 


THE STRUCTURE OF LOCAL CLASS FIELD THEORY. 99 


the new definition. The resulting decomposition of D” depends essentially 
upon the choice of the subfield k+, nevertheless the distinction between the 
ramified and unramified part still persists. 


k.(A’’) / kes, x —~ a) x 


where Z. denotes the cyclic unramified subfield of k«(A”) belonging to the 
character induced by e(o,7). The cyclic field 7” = Z.k’” can also be described 
as the subfield of U’” which is determined by the abelian character of T singled 
out by the values of the original factor set a(o,7)” within B(k) or B(k”) for 
the exponents ¢(o,7) are uniquely determined modulo [U”:k] within B(k-) 
or B(k’”). 

If the prime r does not divide G, then we can select the field k+ used for 
the construction of a division algebra D’” whose degree is a power of r such 
that the relative degrees of all fields between ks and k” are relatively prime 
tor. Hence k” does not split the related algebra Ds + ks, of course provided 
that such an algebra exists.* In such a case the algebra D” is represented as 
the direct product of a ramified and an unramified algebra as we observed 
before. 

Now let q be a divisor of G, and let De = (a, Z ’, 07) be a division 
algebra over ks. Here we can no more assume that the relative degrees of all 
fields between ks and k” are relatively prime to g. Let K be a sufficiently large 
extension of k- such that [K: k+] = q’s is divisible by q’. Then 


where II denotes a prime element of the field K and where [ZvK: K] = q?. 
Hence the algebra D. X K and a fortiori the algebra D. X Kk’ are equivalent 
to unramified algebras over K and k” respectively. Thus we see that there do 
not exist ramified division algebras D’”’ over k’” whose degree is a power of a 
prime q dividing the infinite characteristic G. Observe that there may very 
well exist unramified division algebras over k’” whose degree is a power of such 
a prime q. 

We next wish to construct an example of a perfect field & whose value 
group B(k) is not discrete and where all normal algebras of degree n form a 
cyclic group of order n. Let k be a finite p-adic number field, then all classes 
of algebras which can be represented by normal division algebras whose degrees 
are divisors of n form a cyclic group of order n for each integer n. Consider 


** For a more detailed treatment of the perfect fields arising from infinite algebraic 
number fields see 


100 O. F. G. SCHILLING. 


the field k’ = k{t} of all formal power series in ¢ which possess coefficients 
in the field k. The adjoin to k’ all radicals ¢'/2” where g runs over all rational 
primes and y1,2,---. The resulting enumerable infinite extension k” 
possesses an infinite characteristic G which is divisible by all rational primes. 
According to what we have seen before all division algebras of degree n are 
unramified and consequently they are determined by the division algebras D 
over k, since k is algebraically perfect there can be established a one to one 
correspondence between the algebras over k’” and k. Thus we see that all 
classes of algebras over k” which can be represented by division algebras whose 
degrees are divisors of n for a cyclic group of order n. The same is true for 
the classes of normal algebras over the derived field & belonging to k”’. One 
readily observes that the local class field theory in the form of the isomorphism 
theorem stating the isomorphism of the Galois group of an abelian extension 
A over k with the respective norm class group in k does not hold. Thus we 
observe that the discreteness of the value group ¥(k) is essential for the 
validity of all theorems known in the usual local class field theory. 

As a matter of fact one readily can construct examples of fields k which 
are perfect with respect to a non-discrete valuation of rank one—or of fields 
which are everywhere dense in such fields—such that certain algebras are 
ramified and others are unramified according to the respective degrees. One 
has to consider suitable infinite perfect fields & for which the local class field 
theory holds in part, i.e. such that all algebras whose degrees divide a fixed 
G-number form cyclic groups, then one adjoins to k{t} sufficiently many 
algebraic quantities such that the infinite characteristic of the resulting field 
k”’ is equal to the previously fixed G-number and such that the algebras over 
k’’ whose degrees are relatively prime to the G-number form cyclic groups. 
The resulting field k’” has the property that all algebras whose degrees divide 
a fixed integer n form a cyclic group of order n. Moreover, examples of such 
a type show that the cyclicity of class groups of algebras over a field does not 
imply that the field is perfect. Repeating the process of adjoining trans- 
cendental quantities like ¢ one can find fields which admit a non-discrete valua- 
tion of arbitrary type of ordering but such that the special property of the 
class groups of algebras holds. 


THE JOHNS HOPKINS UNIVERSITY. 


ents 
onal 
mes. 
are 
as D 
) one 
t all 
yhose 
e for 
One 
hism 
nsion 
1s we 
r the 


vhich 
fields 
s are 

One 
field 
fixed 
many 
field 
3 over 
roups. 
divide 
f such 
es not 
trans- 
valua- 
of the 


GROUPS WHOSE COMMUTATOR SUBGROUPS ARE OF 
ORDER TWO.* 


By G. A. 


If the commutator subgroup of a group G@ is of order two the commutators 
of G are invariant and hence every operator of odd order appears in the central 
of G since such an operator could not be transformed into an operator of even 
order. It therefore results that when G involves operators of odd order it is 
the direct product of a group of order 2” and of an abelian group of odd order. 
It is desirable to exclude direct products in what follows since all except one 
of the factor groups would be abelian. Hence we shall assume hereafter that 
the order of G is 2” and that G is not a direct product. The central of G 
includes the squares of all the operators of G and hence the central quotient 
group of G@ is abelian and of type 1", where n is even since the subgroup com- 
posed of all of the operators of G which are commutative with two of its 
non-commutative operators is of index 4 under G. 

It is possible to construct as follows a G whose central is an arbitrary 
abelian group. If ¢,,t2,-- +, ¢, is a set of restricted independent generators 
of this abelian group we divide these operators into distinct pairs when / is 
even or we divide ]—1 of them into distinct pairs when | is odd. In the 
former case we construct two operators whose squares are the operators of such 
a pair and that one of these two operators generates the commutator subgroup. 
Each of these two operators may be assumed to transform the other into itself 
multiplied by the commutator of order 2. The remaining pairs of independent 
generators may be assumed to be such that none of them generates the com- 
mutator subgroup but that each of the operators of a pair is the square of an 
operator of G and that two such operators are again non-commutative but are 
commutative with all of the other operators thus constructed. 

We thus arrive at a G which has the given abelian group for its central 
and is not a direct product of two groups since its central is generated by the 
squares of its operators. When / is odd we may proceed similarly with the 
exception that the operator which does not appear in a pair may be assumed 
to be the only one of the set of independent generators which separately 


* Received Aug. 3, 1937. 


| 
101 


102 G. A. MILLER. 


generates the commutator subgroup and is a non-square. This group is again 
not a direct product since if it were a direct product one of the factor groups 
would be abelian and would not involve the commutator of order 2. Hence 
there results the following theorem: It is possible to construct a group of 
order 2” which has an arbitrary abelian group whose order is of this form as 
its central and has a commutator subgroup of order 2 but is not the direct 
product of two groups. Whenever two G’s have centrals which are not simply 
isomorphic the G’s are not simply isomorphic but there are also non-simply 
isomorphic G’s which have simply isomorphic centrals. 

The G’s which have for their common central the group of order 2 are 
identical with the non-abelian groups which separately satisfy the condition 
that the squares of their operators constitute the group of order 2. It is known 
that there are three infinite systems of such groups.’ We proceed to determine 
the G’s which have for their centrals the group of type 1", where n > 1, and 
shall first impose the additional condition that each of the operators of order 
2 contained in such a group is invariant. Let s, and s, represent two non- 
invariant operators of order 4. The group generated by s, and s. may be of 
order 8, 16, or 32. When it is of order 8 it is the quaternion group. When 
it is of order 16 it contains 12 operators of order 4 which have two distinct 
squares and hence it is completely determined. When it is of order 32 the 
squares of s, and s, do not generate the commutator subgroup and hence it is 
again completely determined. 

Suppose now that G contains the quarternion group. Its subgroup com- 
posed of its operators which are commutative with all the operators of this 
quaternion group cannot involve any operator whose square is the commutator 
of order 2. If this subgroup involves two non-commutative operators of order 
4. they therefore generate the given group of order 32. This is also the case 
when the subgroup of index 4 under this subgroup composed of all its opera- 
tors which are commutative with each of these two non-commutative operators 
involves two non-commutative operators of order 4, ete. By continuing this 
process we finally arrive, when G is of finite order, at an abelian subgroup of 
type 1", n being of the form 2m + 1, where m + 1 represents the number of 
these successive steps. Such a system can readily be constructed by starting 
with the abelian group of type 1°”** and inserting the quaternion group at al 
arbitrary stage of the process. The order of G is 2*"**, where m is an arbitrary 
nositive integer or zero. This system of groups is characterized by the follow- 
ing properties. Each group has the group of order 2 for its commutator sub- 


1G. A. Miller, American Journal of Mathematics, vol. 55, pp. 417-420. 


\ 
i] 
j 


the 
it is 


this 
ator 
rder 
case 
yera- 
itors 
this 
ip of 
or of 
rting 
it an 


trary 
llow- 
sub- 


GROUPS WHOSE COMMUTATOR SUBGROUPS ARE OF ORDER TWO. 103 


group, involves the quaternion group, contains only operators of order 2 and 4 
besides the identity, every operator of order 2 is in the central and every 
operator of order 4 is non-invariant, there is one and only one group for a given 
positive or zero value of m. 

When G@ does not involve two non-commutative operators of order 4 which 
generate the quaternion group but contains two such operators which generate 
the given group of order 16 then we can proceed similarly to construct an 
infinite system of groups which satisfy the conditions in question. In this 
case we arrive at the abelian group of type 12°"*”, and each of the successive 
groups, after the first, is again of order 32. This system is characterized just 
as the preceding one except that the quarternion group therein is replaced by 
the given group of order 16 and the order of @ is 24™*”, Finally, when every 
two non-invariant operators of G generate the given group of order 32 there 
results the third and last system of groups which are separately characterized 
by the facts that each group has the group of order 2 for its commutator sub- 
group, contains no operator whose order exceeds 4, all of its operators of order 
2 are in its central while each of its operators of order 4 is non-invariant. The 
order of each of these groups is 2*"*°, where m is a positive integer or zero, 
and there is one and only one such group for every such value of m. That is, 
there are three and only three infinite systems of groups which separately 
satisfy the three conditions that each of them has the group of order 2 for its 
commutator subgroup, contains no operator whose order exceeds 4, all its 
operators of order 2 are in its central but none of its operators of order 4 
appears therein. 

We proceed to consider the groups which are characterized by the con- 
ditions that their centrals are of type 1", n >1, and that they separately 
involve non-invariant operators of order 2. Such a G@ is generated by its 
operators of order 4 since these operators could not generate a proper subgroup 
such that each of the remaining operators is of order 2 since this proper sub- 
group would be abelian and its operators would be transformed into their 
inverses by the remaining operators of G. When all the operators of order 2 
contained in G are relatively commutative they generate an invariant abelian 
subgroup of G. As G is supposed to contain operators of order 2 which are 
not commutative with all of its operators of order 4 it results that it contains 
two non-commutative operators of order 4 which generate a group of order 16 
involving non-invariant operators of order 2. Hence the following theorem: 
If the central of a group whose commutator subgroup is of order 2 is of type 
l"and if this group involves non-invariant operators of order 2 but all of its 
operators of this order are relatively commutative then it contains invariantly 


in 
1ps 
nce 
of 
as 
ect 
ply 
ply 
are 
ion 
wn 
ine 
and 
der 
on- 
of 
hen 
inct 


104 G. A. MILLER. 


the group of order 16 which ts characterized by the fact that tt contains exactly 
eight operators of order 4 which have two distinct squares. 

When the operators of order 2 contained in G generate an abelian sub- 
group then G can be constructed by successively extending this subgroup by 
operators of order 4 which are commutative with half of the operators of this 
subgroup and whose squares do not include the commutator of order 2 con- 
tained in G. This process of extending this subgroup and the resulting group 
is continued until we arrive at a group whose central is generated by the 
squares of the added operators of order 4. The extending operator in each 
case has a square which does not appear in the group of the squares of the 
operators of order 4 previously constructed. If the original abelian subgroup 
is of order 2" the number of these extensions is k — 1 and the central of each 
extended group is half of the central of the preceding group. The smallest 
group in this system is the group of order 16 noted at the close of the preceding 
paragraph, and the central of each of these groups contains one and only one 
operator of order 2 which is not a square, namely the commutator of this order. 

Suppose that a group whose commutator subgroup is of order 2 and whose 
central is of type 1” is generated by its operators of order 2. Two non- 
commutative operators of order 2 contained in such a G generate the octic 
group. The subgroup of G composed of all its operators which are commu- 
tative with every operator of this octic group is also generated by its operators 
of order 2 since G is not a direct product. Hence there results the theorem 
that if a group whose commutator subgroup is of order 2 and whose central 
is of type 1” is not a direct product it belongs to the infinite system of groups 
which is characterized by the facts that more than half of the operators of each 
group are of order 2 and that the squares of the operators of each group con- 
stitute the group of order 2. It therefore results that the groups under 
consideration are not separately generated by their operators of order 2. 

The groups whose commutator subgroups are of order 2 and whose cen- 
trals are of type 1", n > 1, therefore have the property that they involve proper 
subgroups generated by their operators of order 2 whenever they involve non- 
invariant operators of this order. Such a proper subgroup involves invariant 
operators of order 2 besides the commutator of this order and the squares of 
its operators of order 4, and @ can be constructed by extending such a sub- 
group by operators of order 4 which have different squares and whose squares 
do not include the commutator of order 2. Such an extending operator is 
non-commutative with an invariant operator of order 2 of this subgroup which 
is not the commutator nor a square. If the resulting group contains al 
invariant operator of order 2 which is not a commutator or a square this 
extension is repeated until the resulting group contains no invariant operatot 


| 
4 

i 


ctly 


sub- 

by 
this 
con- 
roup 

the 
each 

the 
each 
allest 
oding 
y one 
yrder. 
whose 
non- 
octic 
mmu- 
rators 
eorem 
entral 
TOUpS 
f each 
p con- 
under 
2. 

e cell- 
proper 
e 
rariant 
ares of 
a sub- 
squares 
ator is 
which 


ins al 
re this 


perator 


GROUPS WHOSE COMMUTATOR SUBGROUPS ARE OF ORDER TWO. 105 


of order 2 besides the commutator of this order and those which are squares. 
By these methods all the groups whose centrals are of type 1", n > 1, and 
whose commutator subgroups are of order 2 can be constructed. 

It was noted above that the squares of all the operators of G appear in 
the central of G. When the commutator of order 2 in G@ is the square of an 
invariant operator of G then the squares of the operators of G generate a sub- 
group composed of operators which are squares. This results from the fact 
that if s and ¢ represent any two operators of such a G@ then s'é? is the square 
of an operator of G since when s and ¢ are non-commutative then sé? is the 
square of.st multiplied by an invariant operator of order 4 which generates 
the commutator of order 2. The central of G may contain operators whose 
order exceeds 2 which are not squares but generate the commutator of order 2. 
It may also contain operators which are not squares and do not generate the 
commutator of order 2. In the latter case such an operator generates also the 
square of a non-invariant operator of G since G would otherwise be a direct 
product. Hence there results the following theorem: If the central of a group 
whose commutator subgroup ts of order 2 contains operators which are not 
squares of other operators of the group then such an operator either generates 
the commutator of order 2 or the square of a non-invariant operator whose 
order exceeds 2. 

If s is a non-invariant operator of lowest order in G then s can not 
generate the commutator of order 2 if @ contains an operator of larger order 
which generates this commutator. If there is another operator in G of the 
same order as s which is non-commutative with s then this operator can not 
generate the commutator of order 2 in @ if its order exceeds 4. Hence it 
results that we can select a set of operators of G in the following manner: 
The first two are non-commutative operators of lowest order in G, the second 
two are non-commutative operators of lowest order in the subgroup of index 4 
composed of all the operators of G which are commutative with the first two 
selected operators, etc. Then if one of these operators generates the com- 
mutator of order 2 contained in G none of the other operators of the set can 
have this property unless the two operators are of order 4. 

From what precedes it results that the groups whose commutator sub- 
group is of order 2 can be divided into two categories. In one of these the 
commutator of order 2 is generated only by operators in the central while in 
the other this commutator is generated by some non-invariant operator. In 
the latter case a set of independent generators of each of the groups can be so 
selected that one and only one of them generates the commutator of order 2 
unless two of them appear in the quaternion group. In this case none of the 
remaining operators of the set generates the commutator of order 2. Hence 


106 G. A. MILLER. 


the construction of these groups is to a large extent reduced to the construction 
of groups generated by two non-invariant operators in addition to the central. 
Therefore we proceed to consider this special case. 

If these two non-invariant operators are in the quaternion group then the 
central may be any cyclic group which generates the commutator of order 2 
but it can be no other group. When the order of this cyclic group exceeds 2 
the group is also generated by two non-invariant operators which appear in 
the octic group together with the same central cyclic group. When each of 
the two non-invariant generating operators has an order which exceeds 4 and 
one of them generates the commutator of order 2 the central may be an arbi- 
trary abelian group having at most two invariants, which are at least equal 
respectively to half the orders of these two independent generators. Finally, 
when neither of the two non-invariant generating operators generates the 
commitator of order 2 the central may be an arbitrary abelian group having 
at most three invariants. One of the three independent generators of the central, 
in the case that there are three such generators, then generates the commutator 
of order 2 and has an arbitrary order while the orders of the other two are 
respectively at least equal to one-half of the orders of the two non-invariant 


generators of G. 


UNIVERSITY OF ILLINOIS. 


CONVERGENCE OF A SEQUENCE OF LINEAR 
TRANSFORMATIONS.* 


By M. H. Incranam and M. C. Wo r.' 


The purpose of the present paper is to study conditions on a sequence 
{Ai} of n X n matrices with elements in the complex field, such that if an 
infinite sequence of linear transformations with matrices {A;} is applied to 
any bounded region of an n-dimensional vector space, the region converges 
uniformly to the origin. Certain phases of the problem are discussed in 
general for n-space and carried out in detail for n = 2. 

The work on this paper was suggested by a problem in the theory of 
genetics.” 


I. Sufficient conditions in the general case. An n-dimensional vector 
é after the application of a linear transformation with n X n matrix A is the 
vector Aé of length V#A’AE with direction components which are in general 
different from those of & If A& —Aiéi, then & is an invariant direction 
(characteristic direction or vector) of the transformation with matrix A, and 


is said to correspond to the characteristic value A;._ In general, for convenience, 
the vectors will be taken unitary. 


* Presented to the American Mathematical Society, April 9, 1937. Received by the 
Editors, July 15, 1937. 

*The work of M. C. Wolf was supported by a grant from the Wisconsin Alumni 
Research Foundation. 

* For a purely Mendelian case of a characteristic determined by one gene, there are 
three possible types: 1) homozygous dominant, 2) heterozygous, and 3) homozygous 
recessive. The proportions of these three types in a population can be represented by 
a point in a plane, or in the case of n genes by a point in space of 2n dimensions. 

Under certain hypothesis as to selective mating and productivity the coérdinates 
of the point representing one generation can be given as quotients of quadratic forms 
in the coérdinates of the point representing the preceding generation. 

Invariant points represent conditions of equilibrium. These may be of stable or 
unstable type depending on whether or not the invariant point is the limit of the 
iterate of the transformation operating on all neighboring points. This may be studied 
by means of the iterates of the Jacobian of the transformation at the invariant point. 
As the transformations for any case but that of an “ infinite population ” vary slightly 
from generation to generation within very small limits, a better picture of the actual 
case is obtained by studying the products of a finite number of nearly equal transforma- 
tions taken from an infinite sequence of such transformations rather than the iterates 
of a single transformation. 


107 


on 
al. 
he 

2 

2 
in 
of 
id 

i- 
al 
he 
1g 
al, 
or 
re 
nt 


108 M. H. INGRAHAM AND M. C. WOLF. 


When £14, = @2€2 = 1, then | @,¢.| <1. Hence for any matrices P and 
S and any unitary vector é, it follows that for all unitary », 


(1) | @RSE | S Vmax max 


If A is an nm X n matrix with characteristic values 4; (1 = 1, 2,- - -,n), 
and if the columns of a non-singular matrix T are a set of corresponding 
characteristic directions ¢,, é,- én, then A = where 


0> 


An) 
If —1, and if then S N (As), where 
N(A,) is the norm of A,;. Since A = TBT™ it follows from (1) that 


(2) AE N(A,) 


where K is a function of the columns of 7, and hence depends only on the 
characteristic directions of A. K may be taken greater than unity. It is 
important to note that N(A,)K? is an upper bound for the Hermitian form 
&R’Ré for unitary é associated with any matrix R for which the characteristic 
directions are given by the above matrix T and for which N(A,) is the maxi- 
mum norm of the characteristic values. 


THEOREM 1. The wmtary vectors converge uniformly toward the origin 
under a sequence of transformations with matrices {Ai} where A, = A+ Ki, 
if {Ei} is a sequence of n X n matrices such that @ h’,ER,é S H? for all i and 
all unitary &, N(A,) and K are the constants described above, and if H 1s 
positive and satisfies the condition [N(\i;) + KH] <1. 

Let 

k 


i-0 
k 
where P; is a sum of , ) terms ; each term of P; is a product of k matrices i 


which the matrix A is a factor / —1 times and the remaining 1 factors are 1 of 
the matrices £,, F.,- - -,H,. For example, if k = 9 and i = 3, A?H,AE;E,A’ 
is a typical term of P3. ae 


i-0 


2k 2k 
=F 
1=0 


1=0 


= 


and 


here 


in 
i of 


CONVERGENCE OF A SEQUENCE OF LINEAR TRANSFORMATIONS. 109 


where each @; is a sum of * products of the form X,X.---X;Yi:Y2:-- Yu. 


Of the X;, k —s are A’ and s are certain of the E’; k —t of the Y; are A and 
the remaining ¢ are certain of the F;; t+ s—J/. To determine a bound for 
apply (1). Let =M*. The bound for as de- 
termined from (2) is i HESH; K enters 
the bound for Q.é for every separate power A”? or A’? in Q:, but the factors 
in the product X,X.-- +X; are separated by at most s+ 1 times 
and the factors A? in Y,Y.- - - Y; are separated by F;’s at most ¢ + 1 times, 
hence it follows that if K =1 


| LOE | ey ‘ 


| #Q:é | < K?- ( 


For example, suppose a term of Q;, for k = 8, is 
= A’: A*E,A7E,. 
(M*K-H- H) =M"K*H® 


| < K*- 
Clearly when K = 1 
< K?(M + HK)*., 
If K < 1, then 
(M + HK)*. 


If (M+ HK) <1, lim(M+HK)*-0. Hence if |A,| <1 and if 
A+ H;, differs but slightly from A, that is, if @£’,#i.é = H? is sufficiently 
small, the region about the origin will converge uniformly to the origin under 
the sequence of transformations with matrices {A}. 
In the remainder of this section more detailed conditions for convergence 
and divergence of vector spaces will be given. 


THEOREM 2. If {gi(A)} is a@ sequence of polynomials in the above re- 


stricted matrix A, and if lim N(pm) 0 where N(pm) is the maximum norm 
m 


of the characteristic values of T[ gi(A), then any bounded portion of space 
i=1 


|| 
ling 
|| 
the 
t is 
orm 
istic 
axl- 
Ki, 
and 
I is 


110 M. H. INGRAHAM AND M. C. WOLF. 


tends uniformly toward the origin under the sequence of transformations with 
matrices {gi(A)}. 


m 
This follows from (2) since the characteristic directions of [[ gi(A) are 


those of A; furthermore, if A; is a characteristic value of A, gj(Ai) is a char- 
m 
acteristic value of g;(A) and [J g;(Ai) is a characteristic value of JJ g;(A). 
j=l 
CoroLtuary 2.1. If A, restricted as above, is a matrix with characteristic 
values 44 such that N(Ai) <1, then under iteration of the transformation 
whose matria is A, the whole space tends uniformly toward the origin, and 
if N(Xi) > 1, the space diverges. 


THEOREM 3. If {A;} is a sequence of matrices with characteristic values 
J 

ij, the condition that | A; | <1, for every i and j, is not sufficient to insure 

that a bounded region in space will tend to the origin under the transformation 


with matrices {A;}. 


An example of such a sequence may be constructed by iteration of two 
transformations alternately. The first takes a unitary vector € into » where 
7 =k? > 1 and the second rotates » into J where k? > I? > 1. 

On the other hand, there exist sequences {A;} of matrices for which the 
norm of one characteristic value for every A; is greater than unity, yet the 
unitary vectors converge uniformly to the origin. The sequence which is 
alternately A, and A, is an example of such when 


TueorEM 4. If {A;} is a sequence of matrices such that 
I Aj || * * Ain | > 1, 


then the set of unitary vectors will not converge to the origin under the 


sequence of transformations with matrices {A;}. 
Let Bu = AmAm-1 A. 
I Bin Am | | Am-1 | - A; > 1. 


It suffices to prove that there exists at least one vector such that 
7 B’mBmn > 1 for infinitely many values of m. Let 


the 


that 


CONVERGENCE OF A SEQUENCE OF LINEAR TRANSFORMATIONS. 111 


( o1(m) 


o2(m) 


0 on(m) 


T'mT'm =I. The set of functional values of Cg is equal to those of (B’mBm 
for unitary Consider real unitary vectors with codrdinates 2, 
Suppose 2, ~ 0 and 


(3) o,(m) =o2(m) =- -=on(m) = 0. 
| Cm |= 01(m)o2(m) -on(m) > 1 


is an increasing function of m. Hence I!C,n»l>k>1 some k and 


>Vk>1. From (3), If 2, 7/Cmy > 1. 


It follows that there exists a vector £m and an e,, where e is independent of m, 
such that > 1 and B’mBmt; > 1 for all £;- for which 


| ihe 1| 
Vidi* Cake 
The infinite sequence of vectors £m (m= 1,2,- +) has a limit vector ¢. 


Let £,; be an infinite subsequence of £m such that 


VEE: 
then PP Bit > 1 for an infinite subsequence {/,;} of {Bi}, as was to be 
proved. 
The proof of the next theorem is similar to that of Theorem 2. 


< 


THEOREM 5. If {Aj} is a sequence of matrices such that the characteristic 


values (j =1,2,--° for all i correspond respectively to the linearly 
independent invariant £1, and if | <1, the 


unitary vectors converge uniformly to the origin. 


The point of view of this paper is intimately connected with the Cauchy- 
Riemann conditions of analytic function theory. A 2X 2 matrix A = (aij) 
may be interpreted as the Jacobian matrix of a transformation U = u(2, y) 
and V=v(a,y). At a point the Cauchy-Riemann conditions are that 


pith 
are 
A). 
Lion, 
ure 
two 
vere 
the 
the 
7 


112 M. H. INGRAHAM AND M. C. WOLF. 


22 and If ) and ) are the invariant directions 
1 2 
of A, m, and mz satisfy the equation z* + 10 unless a, = 0, in which case 
A is of the form AZ where A is real. The Cauchy-Riemann conditions are 
therefore equivalent to having 


i= (;) and ) 


as characteristic vectors and A, and A, —A, as characteristic values. It is 
clear that these conditions give conformal representation, since if 


m = + and Coré1 + Cooke, 
then 
7 A’Ane 7 12 
V7 2A’Ane V7 
which is the condition that the angle between 7, and yp is preserved (except 
possibly for sign). This equality follows since €1.é. = &2é, = 0, whence also 
for 1,7 = 1,2 


II. Characteristic directions of products in real 2-space. It is seen 
that the conditions under which the whole space under successive transforma- 
tions tends toward the origin depend not only on the maximum stretch under 
any one transformation, but also upon the relations of the characteristic di- 
rections of the various transformations. It is plausible therefore that if the 
characteristic vectors of a series of matrices were nearly the same, results could 
be secured which are analogous to the case in which they are equal. Therefore 
this section is devoted to the study of the characteristic vectors and values 
of the product of 2 & 2 matrices. 

Suppose A and B are 2 X 2 matrices with characteristic values A, and d,, 
and y; and yz respectively where | A, |= and | y:|=|y2|. Suppose 
é, and é are the characteristic directions of A corresponding to A, and 
respectively, and €, + 8,é. and & + 8.€, are the characteristic directions of B 
corresponding to y, and y2 respectively, where = = 1 and é, and & 
are linearly independent. 


Hence 


CONVERGENCE OF A SEQUENCE OF LINEAR TRANSFORMATIONS. 113 


Similarly 
ons 
are Therefore 
If 6,8. ~ 1, then 
1 
t is Bi, = {(y1 — 8:82y2) + 8:(y1 — é2}. 
Also 
1 
=| {(y2 = 8, 8271) &2 + 82(y2 —yi)é:}. 
1— 8,8, 
Suppose a characteristic vector of the product BA is of the form é, + ré2, 
ris then a function of Aj, yi, = 1,2). Suppose o is the characteristic 
value of BA corresponding to €, + ré,. Then 
cept 
also (4) BA(&, + =o (& + ré&). 
But 
BA (é; + ré2) — + 
seen 1 
ma- (5) Gs) (yi — 8:82y2) + — 
di- 


the Since €, and é, are linearly independent, it follows from (4) and (5) that 
ould 


fore A181 — + (v2 — 818271) 
lues Ai (yi — 818272) + (y2— y1) 
Hence 
(6) == (y2 — yr) + {Ar (71 — 818272) — Ao (y2 — 818271) }r 
+ — 71) = 0. 
d ds 
of B Similarly if é, + ¢é, is a characteristic vector for BA, then 
id & 


(7) f2(t) = ,6, (y:1 y2) + {2 (v2 — 8,82y1) — §,82y2) 


+ d282(y1 — y2) = 0. 


If only those matrices are studied for which Aj, y:, £:, 8; are real, | 8; | < 1, 
| |<k<1 and | <1, then bounds for the characteristic 


8 


114 M. H. INGRAHAM AND M. C. WOLF. 


directions of the product BA depend upon the relative signs of the determinants 
|Bl=yry2 and 


Case I. Suppose AA, = 0 and yiy2=0. Since f,(0) = — y;) 
and = 8:y2(1—8,8.) —Az), the sign of f,(0) is the sign of — A,y,8, 
and that of f,(6:) is the same as 6,A;y2. Since A; and Az are of the same sign, 
and y; and y2 are of the same sign, there is a root of (6) between 0 and §,. 
Similarly, there is a root of (7) between 0 and 8. 


Case II. If, however, A;A2 = 0 and yiy2 = 0, there exists a root of (7) 
between 0 and 8, but f,(0) and f,(8,) are of the same sign. Consider 


(281) = 8: + 2A2(— y2) —Ar(— v2) ] + Am 
| K,|<4(k? +k) | |. 


The term [Aryi + 2A2(— y2) —Ai(—vye)] bears the sign of Aiy:. Since in 


this case the sign of E©) is the sign of — 8,, there is a root of (6) between 0) 


and 26, if the sign of a). has the sign of 8,. This is the case if the values 


9 
of 8, and 8, are sufficiently small, so that K, will not affect the sign of bec A 


A bound for these values of 8, and 6, may be found depending upon & alone. 


Case III. If AyAz = 0 and yiy2 = 0, there exists a root of (6) between 0 
and 8,, but f2(0) and f2.(8) are of the same sign. 


fo( 282) §2[3y2(— Az) + 2171 (— Az) | + 
| K, | S4(1 + 2k) | 8,8,? |. 
fe(— 262) . 
The sign of ap aaa is that of 6 if 8, and 8 are small enough so that K;, 

does not affect the sign of ia Then there is a root of (7) between 0 

1Y1 
and — 28. because the sign of f2(0) is the sign of — 6. 


Case IV. If and yiy2 S 0, consider 


k 


3k — k? 


| 

«|° 

| (1—k) 

i 


in 
1 0) 


ues 


ne, 


n 0 


n 0 


CONVERGENCE OF A SEQUENCE OF LINEAR TRANSFORMATIONS. 115 


The sign of —~;———~ is that of 8, if 8, and 8 are small enough so that K; 
A1y1 


does not affect the sign of f; (; #5). The sign of £0) is that of — 4,, 


hence there is a root of (6) between 0 and —— i The sign of 0) is that 
171 
of — 8. Consider 


Ai 
k) 


| K,| | 8.8.? |. 


Neglecting K,, the ee of ae ee is that of 8, hence there is a root of 


(7) between 0 and [— 


i - when 8, and 8, are sufficiently small, a bound for 
them depending ee the value of & alone. 


Since the smaller roots of (6) and (7) are such that in Cases I, II, 


III, |r| < | 28 | and |¢t| < | 28 |, and in Case IV | and 


7% ha , when 8, and 8% are small quantities, the poms terms in 


rand ¢ ef (6) and (7) may be neglected as well as quadratic terms in §, 


(y1 


and 6.. Consequently 7 is of the order of 


— Ary 
Equating coefficients in equations (4) and (5), the characteristic value 


corresponding to + r& becomes 


and ¢ is of the order of 


§ ) 
F ] — 5,8, 
Similarly the characteristic value p corresponding to & + #é, is 
ny 
(9) p 5,5. 
1 - 5 


|r| 


ts 
1) 

n, 
—— 
— 
f 
(v2 ) 


116 M. H. INGRAHAM AND M. C. WOLF. 


Hence for small 8, and 8, o is of the order of Ary1- 


Hene 


e His of the order of we for small 8, and 8 from which it follows that 


p is of the order of Asy2. 
The nature of the results in Cases I-IV were arrived at geometrically. 


The simplest case is illustrated by the accompanying figure. Let OT: and 
OP, be the directions é, and £2 corresponding to the larger and smaller roots 
of A, A; and A, respectively. Let OL, and OQ, be the directions €, + §,é2 and 


Az ( 
—— — — t8,{ 1— 
he 
al —_ $3, 72 a, 1) 7 
{ 
| Ko| <| 8&8 (1+é) | t] | 
& 
| 
\ 
\ 
\ 
\ t 
\ t 
\ 
Ir 
| W 
\ n 
\ 
| \ M 
4 \ tl 
\ 
| \ h 
be 
th 
i t 
| [ 


CONVERGENCE OF A SEQUENCE OF LINEAR TRANSFORMATIONS. 117 


+ 8€,, corresponding to the larger and smaller roots of B, y, and y2 
respectively. In this case A; > Az >0 and y: >y2>0. Under the trans- 
formation with matrix A, OT, and OP, are invariant but OL, and OQ, move 
into regions J and JJ, along the directions OL, and OQ. Under the trans- 
formation with matrix B, OT, and OP, move into regions J and JI along the 
directions OT, and OP,. From continuity considerations it follows that 
invariant directions for the product BA lie in regions I and II. In Cases 
II-III and less simple cases of I geometric arguments were also helpful but 
the diagrams were complicated by reflection caused by the occurrence of nega- 
tive A; and 
THEOREM 6. If {A;} is a sequence of second order matrices with the char- 
a | <k <1, if & and & 
are two directions in space, then for every arbitrarily chosen number e > 0 there 
exist numbers > 0 and > 0 such that if the characteristic directions corre- 
sponding to and As lie in the ranges + and + respectively, 
then the characteristic directions of any product of a finite sequence of these 
matrices lie in the ranges &, + e, and 2 + e,. It is assumed that all quanti- 


tiles under consideration are real. 


acteristic values of A; equal to ri, and diz, where 


Suppose it is known that for each of a sequence of matrices the char- 
acteristic directions corresponding to Aj, and Ajz lie within the ranges €, + 8,€, 
and -+ respectively. Let 6, >0 and 6.>0. The product Qm of m 
matrices may be expressed in the form 

Om = My: Pe: My: Pi, 

where P; is a product of matrices A; each with positive determinant; M; is a 
product of matrices A; with the determinants of the first and last matrices 
negative, and those determinants of the intervening matrices positive. Let 
M; = Aj,Tj;Aj2. The determinants of 7; and M; are positive. Nm is either 
the identity or else is a product of matrices A; with only the right-hand factor 
having a negative determinant, all others being positive. Any P; or 7; may 
be equal to the identity. 

From Case I, for every Pj and 7; the characteristic directions corre- 
sponding to the larger and smaller characteristic values lie in the ranges 
&, + 6,é, and é + &£€, respectively. When 8, and 8, are properly restricted, 
the characteristic values of the product of ¢ of the matrices A; are approximately 


t 
t t 
and Then <& <1. It follows from Case III that 
i=1 
TT 


4=1 


118 M. H. INGRAHAM AND M. C. WOLF. 


the characteristic direction corresponding to the larger characteristic value 
of T;Ajz lies in €, + 82. Using the above theory of Cases I-IV, to compute 
bounds for the invariant direction corresponding to the smaller characteristic 
value it is only necessary to establish limits when the characteristic directions 
of 7; and Aj are on the boundaries of the regions, €, + 8,é, and & + 6£,. 
For example, suppose that the invariant directions of Aj2 are €;—6,é, and 
é, + 8€, and the invariant directions of 7; are €, + 8,é and &,—4.€,. To 
apply the results of Case III note that é, + 8€ and €,— 82, lie in the 
directions 


28, 
(6: — + + 
and 
(é2 + 861) — 5, 51€2) 


respectively. Then the invariant direction of the product corresponding to 
the smaller characteristic value lies between €, + 8.€, and 


48. 
that is between & + 8.€, and & + 153.8, O26 Similarly, if the other 
1%2 


possible bounds for the invariant directions of Aj. and 7; are considered, it 
follows that the direction of the product 7';Aj2 is bounded by é, + p.82é, where 
5 — 6,8, 
1 — 58,8, 
Bounds for the characteristic directions of Mj = Aj,;TjAj. are given 


fo = 3 M2 is of the order of 5 if 8, and & are small. 


through Case IV. For example, suppose the invariant direction corresponding 
to the larger root of TjAjz is —8,&, and that of Aj, is — 8,é2. Suppose 
the invariant direction corresponding to the smaller root of T;Aj- is either 
of € + p2d2;. To apply the results of Case IV, the direction ¢, — 8,é, must 
be expressed in the form 


It follows that the characteristic direction corresponding to the larger char- 
acteristic value of Aj,7'j;Aj2 lies between é, + 8,€ and €, — 7,8, where 7; is 
either 

(1 + k) (1 — k) 

(1—k) — (1+ 


or 
(145) + 


of | 


If 
Si 
li 
ra 
pr 
su 
ne 
pre 
If 
pre 
di 
sid 
Ca 
| 
Th 


ue 
ite 
Lic 
ns 
nd 
he 


CONVERGENCE OF A SEQUENCE OF LINEAR TRANSFORMATIONS. 119 


If the other bounds for the characteristic directions of Aj, and 7T'j;Aj2 are con- 
sidered, bounds for the characteristic directions of M; are similarly found to be 


6 
1—k° 
Since the determinant of M; is positive, the characteristic directions of M;P; 
lie within the larger of the two ranges given for M; and Pj; and since these 
ranges are independent of j, the characteristic directions of any number of 
products of M;P; lie in the same range. It is assumed that 8, and 8 are 
sufficiently small so that the approximation formulas hold to within the limits 
necessary for the above reasoning, and so that the characteristic values of a 
product of matrices closely approximate the product of the characteristic values. 
If an odd number of matrices with negative determinants is included in the 
product Qm, that is, if the determinant of Nm is negative, the characteristic 
direction corresponding to the larger characteristic value of Qm may fall out- 
side of €, + 7,8,é2, but since the characteristic directions of Nm according to 
Case III are in the ranges €, + 8,2 and & + podxé,, it can be shown from 
Case II that the characteristic directions of Q» lie in &, +0,8,€ and 
+ t2d.é,. The bound é, + is of the order of €,'+ 

Hence if ¢ is an arbitrary number, sufficient conditions for the proof of 
Theorem 6 are that dg, and & are sufficiently small to satisfy the conditions 
of Cases I-IV which depend upon alone and also 64 < = and 


& + and -+ The orders of 7, and are less than 


UNIVERSITY OF WISCONSIN. 


to 
er 
it 
re 
oN 
1g 
se 
eT 
st 
I- 
is 


THE INTERRELATIONS OF THE FUNDAMENTAL SOLUTIONS OF 
THE HYPERGEOMETRIC EQUATION; LOGARITHMIC CASE.* 


By Lyte E. MEHLENBACHER. 


1. The problem of this paper is to study the exact nature of the linear 
relations existing between the fundamental solutions of the Hypergeometric 
Differential Equation 


(1) o(1—2) + [y— (a + +1)2] apy 


in which we may consider the constants a, 8, y and the variable z as real or 
complex. The relations of this kind which exist in case each of the three 
regular singular points z= 0, z=1, z= is non-logarithmic in character 
have already been considered by Forsyth,’ Lindelof? and Barnes.* What is 
proposed here is to determine the linear relations between the fundamental 
solutions of (1) when one or more of these solutions becomes logarithmic in 
character. We shall, for the sake of brevity, actually determine only those 
linear relations existing between the fundamental solutions about the point 
z= 0 and the solutions about the point z= o when one of the solutions 
about the latter point is logarithmic. The methods employed are readily 
applied to the cases in which either the point z 0, or the point z=1 is 
logarithmic. The complete set of interrelations in both the non-logarithmic 
cases and the logarithmic cases are included in the author’s dissertation written 
at the University of Michigan under the direction of Professor W. B. Ford, 
and published in pamphlet form by Edwards Brothers of Ann Arbor, Michigan. 


2. For purposes of future reference we shall first set down some of the 
more important relations which occur in obtaining the fundamental solutions 


* Presented to the American Mathematical Society, April 10, 1936. Received by 
the Editors, March 4, 1937. 

1A. R. Forsyth, A Treatise on Differential Equations, 3rd ed., London, Macmillan 
and Co. (1903), pp. 203-222. 

* Ernst Lindeléf, “Sur l’Intégration de l’Kquation de Kummer,” Acta Societatis 
Fennicae, (1), Tome 19 (1893), pp. 3-31. 

*E. W. Barnes, “ A new development of the theory of the hypergeometric func- 
tions,” Proceedings of the London Mathematical Society, Series 2, vol. 6 (1908), 
pp. 141-177. 


120 


i 
] 
i 
i 


OF 


ear 
tric 


| or 
iree 
cter 
t is 
ntal 
2 in 
108e 
pint 
ions 
dily 
1 is 
mic 
tten 
ord, 
ran. 


the 
ions 
1 by 
illan 
tatis 


‘une: 
108), 


FUNDAMENTAL SOLUTIONS OF THE HYPERGEOMETRIC EQUATION. 121 


of (1) about each of its three regular singular points z 0, z=1, z= 
by the usual methods of the Fuchs‘ theory. 
The indicial equation of (1) corresponding to the singular point z = 0 is 


(2) k(k—1) + yk =0, 
the roots of which are found to be 
(3) key = (), ke =1—y. 


Assuming that k,—k, is non-integral, we shall therefore have, according to 
the Fuchs theory two fundamental solutions Y; and Y, about the point z= 0 
having the forms 

(4) Y, = gh ga (n)2"5 Y, = g2(n)2". 


n=0 


Moreover, if we put 


f(z, k) =k(k—1)(1—2) + — (@ + B +1)2] — ape, 


and 


fo(Ie) [f (2, —1) + yh, 


then the same theory indicates that g,(n) will satisfy the linear recurrence 
relation 


(5) +m) + gi(n—1) file +n—1) =0 


in which k is given the value k, = 0; while g.(n) will satisfy the same relation 
with the value k, = 1—y used for k. 

The values of g,(0), g2(0) are arbitrary. For definiteness, let us take 
9:(0) = g2(0) —1, in which case gi(), g2(n) become completely determined 
by (5) for n = 1, 2,3,---. 

When the g:(n), g2(n) are thus determined we find that. when 
k;—k,=y—1 is non-integral the two fundamental solutions about the 
singular point z = 0 can be put in the forms 


(6) 2), 


where the F'(a, 8, y; z) represents the well-known Hypergeometric Function 
defined by 


) 
—it2 


*See J. Horn, Gewéhnliche Differentialgleichungen beliebiger Ordnung, Sammlung 
Schubert L., Leipzig (1905), Section 34. 


to be 
the roots of which are 


after which we may show that if k’,; —k’, =a—£ is non-integral the two 
fundamental solutions of (1) about the singular point z= are 


(10) a — =n an integer = 0, 


the point z= o becomes a so-called logarithmic point. The fundamental 
solutions about z = o then take the forms 


linear relations which connect Y, with Y; and Y¢; likewise those connecting 
Y, with the same Y;, ¥¢. The methods which we shall use rest upon certain 
general results obtained by Professor Ford in Chapter I of his recent book 
entitled The Asymptotic Developments of Functions Defined by Maclaurin 
Series.. In particular, we shall employ the following General Theorem there 
established : 


may be considered as a function g(w) of the complex variable w=<2x + ty 
and as such satisfies the two following conditions when considered throughout 
any arbitrary right half plane x > ay: 


LYLE E. MEHLENBACHER. 


The indicial equation of (1) corresponding to the point z= © is found 
k(&—1)— (a+ B—1)k+ =—0, 


+1, 35 1/z), 
1/z). 


When the exponents (8) differ by an integer, that is when 


= as defined in (9), 
(ft) Y,=Y; Log (1/z) +22 hyz’; hn = 0, ho unless n = 0. 


3. It is our purpose in what follows to determine the exact forms of the 


“THEOREM. If the coefficients g(n) of the power series 


f(z) =X g(n)z"; radius of convergence > 0, 
n=0 


(a) ts single-valued and analytic, 
(b) is such that for all | y | sufficiently large one may write 


| g(a + iy) | < Keel 


5 Michigan Science Series, vol. 11 (1936). 


122 
(8) 
| 
(B) 


und 


tal 


FUNDAMENTAL SOLUTIONS OF THE HYPERGEOMETRIC EQUATION. 123 


where « 1s an arbitrarily small positwe quantity given in advance and where 
K depends only upon xo and e, then the function f(z) defined by (A) is 
analytic throughout any sector S (vertex at origin) of the z-plane which does 
not include the positive half of the real axis and f(z) within S is developable 
asymptotically as follows: 

(©) f(z) ~— » 

We shall make use also of the following Remarks (b) and (e) relative to 
the foregoing theorem, as noted in the same work. 

“(b) In case conditions (a) and (b) of the theorem are satisfied except that 
g(w) has p (p = 1) singularities situated at the points w = wy, We, Ws,°** , Wp, 
none of which are negative integers, the theorem continues to hold true pro- 
vided one subtracts from the right member of (C) the sum of p loop integrals 
of the function 


(D) 


27 Sin rw 


“Tn case the singular points w = wm are poles, the loop integrals may 
evidently be replaced by integrations of (D) over small circles, so that in such 
cases the theorem continues to hold, provided that one subtracts from the right 
member of (C) the sum of the residues of the function 


aq(w)(—-z)” 
at the various poles w = Wm. 

“(e) The theorem may be applied to any Maclaurin series (A) in which 
g(w), besides satisfying condition (a), is such that we may write, when 
> 2, and | w | is large, | g(w) | < K | w |*, where K and ¢ are constants of 
which the latter may be positive, negative or zero.” 

Whenever we apply Remark (b) it shall be understood that for any given 
value of z = pe‘? the function (— z)” is rendered precise in meaning through 
the following convention: 


(12) (— — ev log (-z) — ew [log < 


4. Theorem. Employing the above results, we proceed to establish the 
following theorem which, so far as we have been able to determine, is new: 


TuroreM. The solutions Y, and Y» defined in (6), when extended 
analytically outside their circle of convergence, may be expressed linearly in 


wo 

8 

n 
k 

t 


124 LYLE E. MEHLENBACHER. 


terms of the solutions Y,; and Y., defined in (11), in the following forms, it 
being understood throughout that 0 < argz < 2m: 


+ + 20 + (1/s) 
+ 1) 
T'(y) et 
r(2—y)[¥(a—y +1) + ¥(1—a) 4 20 + (1/s) 
4 — y) 7) 
+ 
where w(a) is the well-known Psi-function defined as the logarithmic deriva- 
tive of the Gamma-function, that is =I’(a)/T (a), where C denotes 


Euler’s constant defined by C = —y (1), and where > (1/s) becomes 0 when 
== (),° 


Proof. Returning to the recurrence relation (5), we substitute the values 
of fo(k +n) and f,(k + —1) and, in order that the notation shall conform 
to that which is customary in the theory of difference equations, we replace 
g(n), g(n —1) respectively by u(x), u(~a—1) and subsequently advance z 
tox-+1. The relation (5) tkus becomes 


(13) q(x) +1) + qo(x) u(x) =0, 
where 

q(t) 

Qo(z) =— (k +2) 2) — ag, 


and & takes the value k, or kz of (3). 
For those particular values of & in which we are interested, namely 


k =k, and k =k, of (3), the roots of g(x) —0 are found to be 


t=— (2k+y), 
and the roots of qo(z) = 0 are 
(14) =—a—k, = — B—k. 


Hence we may write 


® See L. M. Milne-Thomson, The Calculus of Finite Differences, London, Macmillan 
and Co. (1933), p. 245 and p. 250. 


8, it 


ly 


FUNDAMENTAL SOLUTIONS OF THE HYPERGEOMETRIC EQUATION. 125 


When we substitute these values into (13) and then solve the resulting first 
order difference equation by elementary methods we obtain as the particular 
solution u(x) which takes the value 1 when xz = 0 


in which we shall assume at first that neither r; nor r. is zero or a positive 
integer, thus rendering u(x) 40. 

We must notice here that since « — 8 =n = an integer = 0, it follows 
from (14) that 

Now, each of the fundamental solutions Y,, Y.2 is of the form 


(16) Y=2#Sg(n)e;  g(0) —1, 


n=0 


where the g(m) is defined by (15). The function g(x) in (15) coincides with 
the g(n) of (16) when = 0,1, 2, 3,- - - and is, moreover, analytic through- 
out the finite +-plane except for poles of the first order at the points 


(17) 12, —1, —2,° +, f2—n+1 
and poles of the second order at 
(18) °°, 


while in distant portions of the plane lying in any arbitrary right half-plane, 
the same function g(z) satisfies the condition described in Remark (e) quoted 
above, 

In order to determine the asymptotic behavior of the function Y,(zx) 
defined by the Maclaurin series (16) we may therefore apply the General 
Theorem quoted above, subject to Remark (b). When we do so, we see that 
in our present case the g(—z); =1,2,---, in formula (C) all vanish 
owing to the general relation 1/f(— x) =~ 0; =1,2,3,---. Furthermore, 
Remark (b) requires that we subtract from the right-hand side of (C) the 
sum of the residues of the function 


(2) (— _ —2)9(2) (—2)" 


19 
(a + 2k + y) 
at its poles (17) and (18). However, inasmuch as we are interested only in 
determining the values of the constants which join the solutions Y, and Y2 


) 
va- 
ven 
es 
rm 
ace 
in 


126 LYLE E. MEHLENBACHER. 


linearly with the solutions Y,; and Y, it is sufficient for our purpose to deter- 
mine the residues of (19) at the two poles x =r, and t= 1p. 
The residue of (19) at the first of the points (17) involves z to the power 

— 8—k and the residues at the remaining points of the same set involve z 
to the lower powers —B—k—1 to —B—k—n+1=—>—2a—k+1, 
It will be shown presently that the residue at the first of the poles (18) involves 
z*k and also z**logz, while the residues at the remaining points (18) 
involve and 2**8 logz (s =1,2,3,---). It follows that the coefti- 
cient of the highest power of z from both sources will come from the residue 
at the first point of (17). In order to determine the linear relations which 
we are seeking it suffices then to determine the residue of (19) at the point 
x =r, and the logarithmic part of the residue of (19) at the point «=—1,, 
The residue at the pole «vr, is found by an elementary theorem in the 
calculus of residues to be 

When we substitute into (20) the values of r,; and rz as defined, and apply 
the convention (12) to (— z)" this residue may be written as 


(20) 


~p-k 
(21) C2(k, a, B, y)z T(a+k)T(y—B +) 
In order to obtain the residue of (19) at its pole of the second order 
‘ M (a) 
7 =r, we write (19) in the form — Ey , where 


(39) Q(z) 


The residue of (19) at 7, is now found to be 


(23) — M’(r,)2 — M(1r,) 2" log z. 


As we have explained above, we have at first only to determine the logarithmic 
part of this residue, which by reference to (22) is easily found to be 


T (2k + y) ett (Bek) g-a-k 
(24) C1(k, a, B, y)2-* log z = 


We may now apply the General Theorem previously quoted together with 
its accompanying remarks. In this way we arrive at the following preliminary 
result : 


| 
; 


ic 


FUNDAMENTAL SOLUTIONS OF THE HYPERGEOMETRIC EQUATION. 127 


The function Y defined in (16), when extended analytically outside its 
circle of convergence, may be developed asymptotically in the following form, 
it being understood throughout that 0 < argz < 27: 


(5) + ()/et+ +: 


where C,(k, a, B,y) and C.(k, a, B,y) are defined in (24) and (21) respec- 
tively. The asymptotic expansion of the solution Y, of (1) results from this 
when the value k = k, = 0 is used, while the corresponding expansion for Y2 
results likewise from the use of k, = 1— y. 

We are concerned here, however, with the expression of the function Y, 
as a linear combination of the solutions Y,; and Yz, that is, we wish to 
determine constants K, and K.z such that 


Y,~K.iY; + KY. 


with a similar expression for Y,. Upon referring to the definitions (11) of 
Y, and Yz, we can identify by (25) the constant K, with — C;(k, a, B,y). 
The solution Y; contains the factor 2*—z*", We know from the Fuchs 
theory that the non-logarithmic part of the solution Y, does not involve z to 
this power since hn 0. We may therefore identify the constant K, with the 
coefficient of z-8-" in the non-logarithmic part of the right side of (25). This 
coefficient, before taking out the factor C.(k,a,B,y), is the value of the 
coefficient of 2": = 2-** in the non-logarithmic part of the residue (23). This 
is easily found, by use of (22), to be 


(26) Ci(k, a, B, Y) 
+) [p(a+k) +w(y—a+k) +26 + Jem 


We thus arrive at the following result for the solution Y,: 
The solution Y,, when extended analytically outside its circle of con- 


vergence, may be developed asymptotically in the following form, it being 
understood throughout that 0 < arg z < 27: 
(27) Cn(k, B, y)¥s— C1 (k, B, y) Po 


where Cn(k, a, B,y) and O,(k, a, B,y) are defined in (26) and (24) respec- 
tively and in which & takes the value k, = 0. 
The corresponding result expressing the solution Y, linearly in terms of 
Y,; and Y, is obtained from the foregoing by the use of k = k, = 1—y for k. 
We note here that the constants 0,(k,a,B,y), C2(k,%,B,y) and 


er- 
wer 
4 
1. 

ves 
8) 
lue 
ich 
int 
he 
er 
n)’ 

h 

y 


128 LYLE E. MEHLENBACHER. 


Cn(k, a, 8,7) all preserve meanings regardless of whether r; = — «—k or 
2 == — 8 —k is zero or a positive integer, so that the restriction made in 


(15) may now be removed. Since the series Y, and Y. are known to be con- 
vergent for the indicated values of z, the symbol ~ may be changed to =, 
We have now only to introduce into (27) the values of the constants 
Cn(k, a, B,y) and C,(k,«,B,y) as defined in order to arrive at the final 
results stated in the Theorem. 


5. Further results. The solutions of (1) about the point z —1 are 


1—s), 
¥,— +1; 1—2). 


In order to determine the linear relationships between the solutions Y, 
and Y, and the solutions Y,; and Y, we first make the transformation z = 1 —7/ 
in (1) and subsequently apply the same methods to this transformed equation 
as we have employed in the determination of the results in Theorem I. After 
these results have been obtained, we make the reverse transformation 2’ = 1—z, 
The final result obtained in this manner is stated in the following theorem: 


THEOREM II. The solutions Y; and Y4, when extended analytically 
outside their circle of convergence, may be expressed linearly in terms of 
Y, and ¥. in the following forms, it being understood throughout that 


0 < arg (1—z) < 


+y(1—a) + + in—¥(1/s) Jerre 


+1) 


° 

When the fundamental solutions about either the singular point z=0 

or the singular point z= 1 are logarithmic the resulting interrelations are 

obtained by first making the appropriate transformation of the independent 

variable in (1), employing the same procedure as we have used above in the 

proof of Theorem I, and then making the reverse transformation in the results. 


Y; 


Y; — 


ARIZONA STATE TEACHERS 
COLLEGE AT FLAGSTAFF. 


re 


Y; 


THE THEOREMS OF GAUSS-BONNET AND STOKES.* 


By E. R. van KAMPEN. 


The idea of a parallel displacement in vector analysis opened new possi- 
bilities also for the differential geometry of surfaces. The present note contains 
an elementary proof of the Gauss-Bonnet theorem based on this idea. A 
systematic use is made of the correspondence between the surface and the spaces 
of its parameter systems for purposes of subdivision in 7 and for the determi- 
nation of the variation of an angle in 4. 

Third continuous derivatives of the parametric equations of the surface 
occur only at the beginning of 5. Their use may be eliminated by the short 
additional consideration in 6. Thus one obtains also for the Theorema Kgre- 
gium (without the explicit expression for the Gaussian curvature) a proof for 
surfaces of class (.. It may be of interest that the proof below makes no use 
either of the second or of the first fundamental form. 

In 10 a short proof of Stokes theorem is given which is valid for a vector- 
field of class C, and a compact region JT on a surface of class C, if the boundary 


of T consists of rectifiable ares. 


1. Let the letters x, u, v, w,--- represent vectors in 3-space and let uw: v, 
uX v, (uvw) =u: (v X w) represent the scalar product, vector product and 
triple product of vectors respectively. Let a surface S be given by a para- 
metric representation of the form 


defined in the whole plane of the scalar parameters u, u*. The letters R and 7 
will be used to designate curves and regions of § and at the same time the 
corresponding curves and regions of the wi-plane. The label « will have the 
range 1, 2 and any term containing such a label both as a subscript and as a 
superscript must be summed over this range. It will be supposed that (1) is 
at least of class C, and that 

(2) (41 X x2) £0, 


where the subscripts 1 and 2 represent partial differentiation with respect to 


* Received November 15, 1937. 


9 


k or 
e in 

con- 

ants 
final 
= 

Y; 
—7 
Lion 
fter 
— 4, 
om: 
ally 

of 
hat 

= 
ire 
nt 
he 
129 


130 E. R. VAN KAMPEN. 


u* and u’ respectively. The normal vector of S§ is defined as the vector (2) 
divided by its length and will be denoted by x3. Thus one has 


(3) (%:%2%3) >0 and 23(%,%2%3) X 


It is well known that the area of a region T of S may be represented in the form 


(4) Sf du'du— ff do. 
T T 


Similarly, if 


(5) x = x;(u', u?) 


represents, in case of a surface S of class C,, the spherical image of S, one 
obtains for the area of the spherical image of a region T of 8 


(6) f (% 31% 32%) du'du?, 


where the usual conventions have to be used if the correspondence between T 
and its image on the sphere is not one-to-one. One may define the Gaussian 
curvature K of 8 at a point P of S as the limit of the quotient of the area 
of 7 and of its spherical image, when T is a variable part of S of simple form 
(e. g. a u‘-square) which has P as limit. One finds, from (4) and (6) 


(7) (Xg1% 32%) = K (%1%2%3). 


This well-known formula may be taken immediately from the usual definition 
of K by means of the Weingarten differentiation formulae. 


2. Let an are R of class C2, on a surface § of class C2 be represented in 
the form 
(8) ui = ui(s), 


where s is the arc length of R. On substituting (8) in (1) and differentiating, 
one obtains 


(9) = x, tt, 


where the * denotes differentiation with respect to s and the tangent vector * 
of R has length 1 since the arc length is parameter on R. 

It is clear that there is a one-to-one correspondence between vectors in the 
u‘-plane which are attached at a certain point and vectors tangent to S at the 


Tm 


one 


T 
ian 
rea 
rm 


ion 


in 


ng, 


ra 


the 


THE THEOREMS OF GAUSS-BONNET AND STOKES. 131 


corresponding point of S and that this correspondence is continuous both ways. 

For instance to (9) there corresponds the vector w‘ attached at the point (8). 
A system 

(10) v= 0v(s) 


of vectors which is of class C, and is tangent to S at the point (8) of S is 
called parallel along R if 


(11) v = 0(s) is normal to S at (8). 


On placing v = vix; and v- x; —0 one obtains linear differential equations 
for the v‘(s), so that a system (10) exists and is uniquely determined if the 
v'(s) are given for one value of s. Since v is perpendicular to v, the length 
of v is independent of s. Jf (10) is parallel along R, then so is any vector 
obtained from (10) by rotation of (10) in the tangent plane of S at (8) 
over an angle which is independent of s. For if (10) is parallel, so is 
w =x; X v which is obtained from v by a rotation over the angle $7; in fact, 
w= *; Xv+x; Xv is normal to S at (8), since #; is tangent and is 
normal. Furthermore if any two vectors are parallel .along R, so are their 
linear combinations with constant coefficients. 

It follows that, if 0 is the angle from (10) to (9), then # is independent 
of the particular system (10) chosen. The derivative 8 is called the geodesic 
curvature of R and will be denoted by x. One finds, if v has length 1, 


(12) sin = (vxx;), cos = &, 
hence 
x) = (v%x,) + (vxx,) + (vex). 


Here the first and third triple products are 0, since v is normal and x; is 
tangent to S. On choosing v to be * at the point of R under consideration, 
one finds v - 1, hence 


(13) k= 0 = 


3. Let a vector w= w(s) be tangent to S at the point (8), let w be of 
class C, and of length 1 and let the angles from w to x and to vw be denoted 
by» and ¢. Then one has 


(14) o=0-+ ¢, 


if care is taken that all three angles are continuous and that (14) holds exactly 
(and not only modulo 27) at some point of R. Furthermore 


(15) 


sin d = (wvxs), COs = BV, 


132 E. R. VAN KAMPEN. 
and so, in the same way as (13) from (12), 
(16) = (wwx;). 


Now let an are R be divided by a finite number of vertices Px into ares 
of class C,. Denote by a the angle from *(s,—0) to *«(s, +0), where 
—nrS% =-7 and s is the value of s at Py. If «+7 choose the signa- 
ture of a, in such a way that at every P; the angle subtended by the region 
to the right of R is a + 7. 

Let w be continuous at P; and of class C, on the remainder of R. Clearly 
a parallel system v may be defined along & having the same properties. If the 
convention is used that 


(17) (sx 0) — 0) = (Sx +- 0) — w(s;,—0) = 


while ¢ is continuous for every s, then (14) remains true for the arc R. If 
Aw, Ad, Ad denotes the increase of w,3,, while s increases from its initial 
value to its final value, one has, by (14) and (17), 


(18) Aw = Ad + Ad 


and one obtains from (13) and (17) and from (16) 


(19) ad — f « ds + and Ad (wwx;) ds, 
R R 


hence, from (18), 
(20) 4 (sos0x,) ds + Saar. 
R R 


4, The meaning of (20) is not lost if R is a simple closed curve and the 
parameter s begins and ends at a point of R where the second derivative exists. 
It will be assumed for the present that RP is a simple closed curve which is the 
positively oriented boundary of a compact portion T of 8. In that case it is 
easy to define a vector field w which is of class C, on T’, hence continuous on f 
and of class (, on each arc of class C. of R. One can for instance transfer a 
suitable field from the u‘-plane to S. It will be shown that in the case 
considered 
(21) Aw = 27 


First, Aw = Ad, where A is the angle in the u‘-plane, from the vector w* 
which corresponds to w to the vector «* which corresponds to x. This is clear, 


4 
i 


res 
ere 
na- 
i0n 


rly 
the 


If 
tial 


the 
sts. 
the 
is 
wi 
ra 
ase 


ar, 


THE THEOREMS OF GAUSS-BONNET AND STOKES. 133 


since both Aw and AA are integer multiples of 27, while, for increasing s, the 
angles » and A both pass at the same time and in the same direction through 
a multiple of 

Next,’ AA = Ay, where p is the inclination of u‘, i.e. the angle from the 
vector (1,0) to the vector (u',u*). Note that 7, which is compact by as- 
sumption, is represented in the w‘-plane by the simple closed curve R together 
with its interior. Now on this set the inclination of wt may be determined 
as a continuous function:? Hence w* may be changed into the constant vector 
field (1,0) by a continuous rotation. This proves AA = Ap, since both Ad and 
Ap are integral multiples of 27. 

In order to prove (21) it remains to show that Au =—2z. Now this is 
the “ Umlaufsatz ” of which simple proofs may be found in the literature.® 

As a consequence of (21), (20) takes the form 


(22) f (sbwwx,) ds = — Sete, 
R 


R 


5. Let it be assumed for a moment that S is of class C3, so that w may 
be chosen of class C,. Then the theorem of Green may be applied to the 
second integral in (22) as follows, 


(23) (wwx,)ds—= | [ (wiwx;)du' + (w.wx;) du] 
J 
-ff [ (w.vx;), — 
T 


Now one has for the integrand of the last integral, 


(24) (w.wx,), — (W,WX;)2 = (W2,WX; ) 
+ (w.w,x;) + — — — 


*It is clear that this step in the argument could be avoided by choosing w in a 
suitable way to begin with. 

*It is possible to subdivide 7 into arbitrarily small polygons with edges of class C,. 
One may for instance use line segments in the u‘-plane. If ~ could not be defined as a 
continuous function on 7’, the variation of « along the boundary of at least one of these 
Polygons would be 27n ~0. But this is clearly impossible if the polygons are suffi- 
ciently small. 

*H. Hopf, Compositio Mathematica, vol. 2 (1935), p. 50 and pp. 53-55, where 
further references are given; also E. R. van Kampen, Compositio Mathematica, vol. 4 


(1937), p. 272. 


4 


134 E. R. VAN KAMPEN. 


Here the first and fourth terms on the right cancel, while the second and fifth 
are both zero, since they contain only vectors perpendicular to w. Hence 


(25) (wwx,)ds = [ (w.wx;,) 
ff 


It remains to evaluate the integrand of the last integral. Now w and x;, are 
tangent to S, so that in (w.wx;,), w, may be replaced by its normal com- 
ponent which is equal to — since w'x,;=0. Thus 
one finds for the integrand on the right of (25), 


(26) (W* X31) — X32). 


Applying the identity aX (bX c) = (a: c)b— (a-b)c in two different 
ways to a: ((b Xc) X (d Xe)) one obtains 


a: b(cde) —a:c(deb) + a-d(ebc) —a-e(bed) = 0. 


» instead of a, b, c,d, e, one finds that (26) 
equals (%3:%32%;,). Thus (26) is K(x,%2x;) by (7), and (4) shows that (25) 
takes the form 


(27) 


so that the theorem of Gauss-Bonnet (in the simplest case) is obtained from 
(22) in the form: 


(28) J ds +f f K do = 22 — 
r 


R 


Substituting here w, w, X3, X32 


The formula 
= (w.wx;); — >», 


which is implicit in (23) and (27) goes over into well-known formulae, if one 
chooses w to be parallel either to x, or to x,. It is obvious that the identity 
of (26) and (3:%32%,) may be obtained from the rule for sin(«— 8) where 
a and B are the angles from x3, and X32 to w. 


6. Now it will be shown that (25) holds for a vector field w of class (;. 
If a—a(u',u?) and b = b(u', u?) are scalar functions on T of class C,, then 


(29) fa ds = (ab, du' + ab, du?) -ff (a,b, — azb,) du'du’. 
R 


| 

{ 

| 

| 
| 
1 

| 
| 


th 


are 
m- 
1us 


ant 


om 


THE THEOREMS OF GAUSS-BONNET AND STOKES. 135 


This is a consequence of the theorem of Green, since b, 6; and b. may be 
approximated uniformly on 7’ by a polynomial in wu’ and wu? and its two partial 
derivatives. On replacing b in (29) by the different components of w, one 
may verify that 


f (w X xz) ‘iods— ff [ (w X %3)1° W2— X X3)2° w, | du'du’, 
R T 


which identity is equivalent to (25), since (w,w.x,) = 0. 

Since a vector field w of class C, always exists on T’, provided that S is 
of class C2, it follows that (28) holds if § is of class C2, while R consists of 
arcs of class C2. 

Since, as is well known, on a surface of class C, the geodesic curvature 
of an arc & depends on the first fundamental form of S only, and since (28) 
holds for every 7’, it follows that the Gaussian curvature of a surface S of 
class C, does not depend on the second fundamental form of S. Hence the 
Theorema Egregium holds for surfaces of class C2. 


7. It is, of course, not possible to represent every surface § by one repre- 
sentation (1). However, any surface S may be defined by means of a finite 
or enumerable collection of representations (1), with the understanding that 
whenever a part of S is defined by two representations (1), there exists an 
orientation preserving continuous one-to-one correspondence between the two 
sets of parameters involved. The parameter representations (1) and the above 
mentioned correspondence are all supposed to be of the same class Cn, which 
is also the class of the surface S. Now, any compact subset T of S, which has 
as boundary R a graph consisting of arcs of class Cy», may be subdivided by 
additional arcs of class Cy into a finite number of regions Tm, m =1,2,- °°, 
such that the boundary Ry of T is a simple closed curve and that Tm 1s defined 
by means of only one of the representations (1) of S. The proof may be 
sketched as follows. First, as a consequence of the compactness of 7’, this set 
is contained in the part of § defined by means of a finite number of repre- 
sentations (1). Next, the part of 7 defined by a fixed representation (1) is 
a subset of the corresponding u‘-plane of which the boundary consists of certain 
ares of class C,. The part of T which is not defined by any other representa- 
tion (1) and not yet subdivided in the desired way is bounded in this u#-plane. 
Clearly a somewhat larger part of 7’ may be subdivided in the desired way, 
for instance by means of line segments. On repeating this for the finite 
number of representations (1) which are used to define 7’, one clearly obtains 
a proof of the above statement. 


136 E. R. VAN KAMPEN. 


If the numbers of vertices, edges, regions Tm, which occur in the above 
subdivision is do, 4,42, one terms C' = dy) — 4, + ad» the characteristic of T. 
In case n = 2, the independence of C from the particular subdivision is an 
automatic consequence of the Gauss-Bonnet theorem. 


8. Now suppose that a region 7’ ona surface 8 of class C, has a boundary 
R consisting of arcs of class Cz and that do, a), dz; Tm, Rm, m=1,° de, 
have the meaning of 7. Denote by Bmx the interior angles of Tm, so that, since 
(28) holds for each Tm, 


(30) fina +f fx da — Bua). 
Tm 


Rm 


On taking the sum of (30) for all m one obtains 


(31) ds + f f 

R T 

Let Bx, k = 1, 2,: - -, denote all interior angles of the complement of T 

on 8, so that % —8,—~7 are the discontinuities in the tangent vector of R 

if one follows all boundaries of the complement of T in their negative direction, 
The a will be called the oriented angles of R. The right side of (31) is 


— (a — Bmx) — — Bx), 


or 


Here the third term represents the sum of all angles subtended at all vertices 
of the subdivisions of 7 used, hence it is equal to 2xa). The fourth term 
contains the number z each time a region 7m (or the complement of T) 
adjoins a vertex. Hence it also contains the number z each time an edge 
adjoins a vertex. Thus it is equal to 27a,. On using the definition of the 
characteristic C of 7, one sees that (31) goes over into 


(33) x ds + f fx do = — 
R 
Thus one may formulate the general 


THEOREM or Gauss-Bonnet. If K and do are Gaussian curvature and 
area element of a surface 8 of class C2, T is a compact region of S with char- 


t 


si 


} 

j 

° 
| 


THE THEOREMS OF GAUSS-BONNET AND STOKES. 137 


acteristic C and of which the boundary R consists of a finite number of arcs of 
class Cz; if furthermore x is the geodesic curvature and ds the element of 
length of the oriented boundary R of T; if finally the oriented angles of K are 
denoted by a, then (33) holds. 


It may be remarked that the existence of a field w of constant length and 
of class C. may be proved on 7’, whenever the boundary R of T is not the 
empty set. For such a field the considerations of this section prove the formula 
Aw = 27C', where w is the angle from w to the tangent vector of R, and Aw is 
the variation of » along R. This formula may be considered as the generalisa- 
tion of the “‘ Umlaufsatz ” for curved surfaces. The proof may of course be 
based on less regularity assumptions. 


10. Stokes Theorem. The result of 7 may be used to give a simple 
derivation * of Stokes’ theorem for a vector field and surface with boundary 
all of class C;. 

Let S, T’ and #& have the properties of 7 in case n —1, and let v be a 
vector field of class C, defined on a region U which has 8 in its interior.® 
Since Stokes’ theorem clearly follows for T if it is proved for the regions T'm 
defined in 7, it may be assumed that S is defined in terms of a single repre- 
sentation (1) and that T is a compact region with a simple closed curve R 
as boundary. 

Let @ represent a vectorial differential operator of which the action ex- 
tends only to the vector field v. Thus one has, for instance, 


(34) 0:-v= div», 0X v=rot v, 
and on the surface S, 


(35) (x,:d)v—n,, = 
Applying the well-known identity 

(aX b)- (ce Xd) = (a-c)(b-d) —(a-d)(b-c) 
to x,,x., @, v instead of a, b, c,d, one obtains on S, 


(x: X (8 X v) = ("1° 0) x.) — (42:8) 


*A similar proof is given in MecShane’s translation of Courant’s Differential und 
Integral Rechnung. 

°It would be sufficient for the boundary R of 7 to consist of rectifiable curves, 
since in that case Green may be applied also. 


= 


138 E. R. VAN KAMPEN. 
hence by (34) and (35), 


and by (3), 
(36) (rot (%1%2%3) = Xe 


The validity of this symbolic computation is obvious. 
On applying Green’s theorem in the form (29) to the different com- 
ponents of » and x one finds 


fv-tds— ff %,)du'du? 
R 


or, by (36) and (4) 
(37) Jf f de. 
R T 


On defining « ds = ds and x;do = de to be the vectorial elements of length 
and area of R and T' one obtains the 


THEOREM OF STOKES. Jf T is a compact region of a surface S of class C; 
and the boundary of T consists of a finite number of arcs of class Cy, if further- 
more ds and de are the vectorial elements of length and area of R and T 
and v is a vector field of class C, in a region U which contains S, then 


(38) 


THE JOHNS HOPKINS UNIVERSITY. 


( 
| 
| 
i 
| 
f 
i 
i | 
| 


N- 


HOMOMORPHISM OF RINGS AND FIELDS OF POINT SETS.* 


By Morris Kiine. 


1. Introduction. ‘Though systems of sets of points such as covering 
systems, systems of sets defining a space, Borel, and analytic sets have been 
extensively studied, little attention has been paid to questions involving proper- 
ties analogous to group properties of such or more general systems. Recently 1 
M. H. Stone has studied the abstract algebraic properties of Boolean rings 
and their connection with Boolean algebras, and his work has significance for 
systems of point sets which are interpretations of Boolean algebras. The rings 
and fields of point sets studied in this paper do constitute interpretations of 
generalized Boolean algebras, and the complete fields, a term to be defined 
shortly, here studied, constitute an interpretation of the Boolean algebras. 
Hence some of the group properties of rings and fields of point sets are already 
known. Yet, because the point set systems are special cases, we are permitted 
to obtain results for them thus far not obtained for the more general systems 
and perhaps of importance only for point sets. 

The work of this paper has bearing on existing material in that many 
systems of point sets, such as Borel sets and the system of all subsets of a given 
set, form complete fields;* in that a ring of point sets has an immediate 
interpretation as a ring of functions and conversely; * and in that the notion 
of homomorphism between set systems is useful in topological problems.* 
Also, the possibilities of combining point set and group properties as in the 


study of topological groups are interesting. 


2. Rings and fields of sets. This part will deal with the simpler 
algebraic properties of rings and fields of point sets. Not all the results in 
this part are new; in a few instances, as indicated below, the conclusions are 
implied by work on Boolean rings. Nevertheless, a few proofs are given of 


* Received February 10, 1937; revised August 31, 1937. 
1M. H. Stone, “ The theory of representations for Boolean algebras,” Transactions 


of the American Mathematical Society, vol. 40 (1936), pp. 37-111. 


*C. Kuratowski, T'opologie I, p. 22. 

*W. Sierpifiski, “Sur les anneaux de fonctions,” Fundamenta Mathematicae, vol. 
18 (1932), p. 6. 

‘See, for example, E. Gech, “ Théorie générale de ’homologie dans un espace quel- 
conque,” Fundamenta Mathematicae, vol. 19 (1932), pp. 149-183. 


139 


140 MORRIS KLINE. 


old results because they can be obtained so much more easily for systems of 


point sets. 


DeFInitTions. The sum of two systems A and B of pownt sets, indicated 
by A+ B, shall mean the system of sets obtained by adding each set of A to 
each set of B, as point sets; likewise for the product, indicated by A X B, 
and the difference, A — B, of two systems. A single set 1s sometimes regarded 
as a system in the use of these operations. 


A ring of sets is a system of sets to which the sum and intersection of 
any two sets of the system belong.® By a subring we shall understand any 
subcollection of sets of the ring which also form a ring. 

If F is a ring, by the homomorph of F we shall mean a system of sets R’ 
satisfying the following conditions: ® 


1) To each set of # there shall correspond one and only one set of RF’; 

2) Each set of R’ shall correspond to at least one set of R; 

3) If A+ B=C or A-B=D, where A and B are any sets of RF, corre- 
sponding relations shall hold among the corresponding sets of R’. The relation 
in R’ corresponding to one in F will be the same as the latter unless otherwise 
specified. 


If, instead of condition 2), we have that to each set of R’ there corre- 
sponds one and only one set of & we shall say that R’ is the isomorph of R. 

By a residue class of a ring R with respect to a subring S we mean the 
system A + S, where A belongs to R. An ideal is a subring such that if set 
B belongs to the subring and C is any set of the ring, then B-C belongs to 
the subring. 

The very definition of homomorphism gives the following occasionally 
useful two theorems. 


THEOREM I. The homomorph of a ring is a ring. 


THEOREM II. All the residue classes of a ring R with respect to an ideal 
S form a system of residue classes which is the homomorph of R if we let 
sum and intersection in R correspond to sum and product, respectively, of the 
residue classes." 


THeEorEM IIT. Jf Risaring containing a null set,? the system of residur 


°F. Hausdorff, Mengenlehre (1985), p. 77. 

°B. L. van der Waerden, Moderne Algebra, vol. 1, p. 44. 

* Cf. van der Waerden, loc. cit., p. 35. 

* The term null set is here used to mean the empty set or a zero set. 


| 
| 
i 
( 
i 
Hi 


HOMOMORPHISM OF RINGS AND FIELDS OF POINT SETS. 141 


classes determined by any ideal S of R is the isomorph of R under the corre- 
spondence of operations of the previous theorem. 


Proof. If A and B belong to R, then A+ S—B-+S if and only if 
A=B. For, since R contains a null set and S is an ideal, S contains a null 
set. Suppose A+ S—B-+S8. Then A+0—B-+P where P belongs to 
§. Hence A- B. Likewise B + 0—A-+Q where Q belongs to S. Hence 
B- A, and therefore A = B. 

It is now obvious that if we let A correspond to A +S the correspondence 
will, in view of the previous theorem, determine an isomorphism. 


DEFINITION. By a domain of integrity we shall mean a ring in which 
the intersection of any two non-zero elements is a non-zero element. 


THEOREM IV. A necessary and sufficient condition that the residue classes 
with respect to an ideal S of a ring R containing a null set form a domain of 
integrity ° 1s that R be a domain of integrity. 


Proof. If A and B belong to Rk and A- B=0, then the product of the 
corresponding residue classes would have to give S, else the isomorphism just 
proven to exist would not hold. Likewise for the converse. 


Derinition. If the system R’ is the homomorph of the system R, then 
the sets of R which correspond to the set A’ of R’ are said to form the class 
of R corresponding to A’.'° 


THEoREM V. If the set system R’ is the homomorph of the set system R 
and R’ contains a null set, the class % of R corresponding to this null set, 0’, 
isan ideal. The other classes are domains of integrity." 


Proof. That = is an ideal follows as in van der Waerden. If K,4 denotes 
the system of sets of R corresponding to A’ 0’ of R’, then certainly if A 
and B belong to K,, A+ Band A-B belong. Moreover, under any homo- 
morphism of two rings, if # contains a null set it must correspond to 0’ of R’; 
for, if M be any set of R, since M-0 —0, M’: A’ = A’, where 0 corresponds 
to A’. But M’ can be any set of R’; hence A’ is included in every set of R’ 
meluding the null set 0’. Hence A’ 0’. In view of this fact, if A and B 
belong to K4, B 0, else A’: A’ = 0’. Hence K, is a domain of integrity. 


DEFINITIONS. A field of sets is a system of sets to which the sum and 


® The zero residue class is S, itself. 
7° Cf. van der Waerden, loc. cit., p. 32. 11 Cf, van der Waerden, loc. cit., p. 56. 


| 

= 


142 MORRIS KLINE. 


difference of any two sets of the system belong.” A subfield shall mean any 
subcollection of sets of the field which is itself a field. By a complete field 
we shall mean a field to which the sum of all the sets of the field belongs. 


If A and B are any two sets, A: B = A —[(A + B) —B]; hence a field 
is a ring. Hence by the homemorph of a field we shall understand a system 
of sets satisfying the conditions of a homomorph of a ring. Similarly for 
isomorphism. 

The following result is not new but can be obtained at once for systems 
of point sets."* 


THEOREM VI. Lvery field of finite order (i. e., field containing a finite 
number of sets) consists exclusively of sets formed by the process of addition 
applied to a subcollection of mutually exclusive sets of the field. 


Let the mutually exclusive sets be lettered B,, Bo,- - -, Bn. Then every 


n 
set of the field is easily shown to be expressible in the form A = > );B;, 
j=1 
where each 0; is 0 or 1. From this statement we have 


Corottary I. Every field of finite order contains 2” sets.™ 


CorotuAry II. The order of a subfield is a dwisor of the order of the 
field. 


THeEorEM VII. Let the system F” of sets be the homomorph of the field 
F. Then there exists a field F” of sets formed from the sets of F’, which 1s 
the homomorph of F. Moreover, the correspondence between F and F’” holds 
under the operation of difference. 


Proof. Let the null set of / correspond to the set D’ of F’. If A is any 
set of F, since A-0 0, A’: D’ = D’; and J” is therefore contained in every 
set of F’. 

Let the system F” consist of all sets of the form A’ — D’, where A’ is any 
set of F’, and let A of F correspond to A’ — D’ of F”’. If the correspondence 
between F and F” holds under sum, intersection, and difference it will follow 
that F”’ is a field. 


12 Hausdorff, loc. cit., p. 78. Hausdorff uses difference only when the subtrahend is 
a subset of the minuend. We do not. This distinction is unimportant for fields because, 
if A and B are any two sets, A— B= (A + B) —B and hence under either use of the 
term the same sets belong to a field. 

18 Cf. theorems 4 and 12 in the paper of Stone’s referred to above. 

4 This result is obtained by B. A. Bernstein, “On finite Boolean algebras,” Ameri- 
can Journal of Mathematics, vol. 57 (1935), p. 742. 


| 

t 

| 

A 

4 


1s 


HOMOMORPHISM OF RINGS AND FIELDS OF POINT SETS. 143 


That the correspondence holds under sum and intersection is immediate. 
To show that it holds under difference we have but to employ the fact that 
A—B=C implies A=>A-B+C and (A-B)-C=0. 


We should notice that if #” contains a null set, 0’, then D’ = 0’ and 
Ff’ =F”. Moreover, if F is a complete field with S as the sum of its sets,!° 
then if S’ of F” is the correspondent of S, S’ must be the sum of the sets of F”. 
This follows because if A is any set of F we have S: A = A; then 8’: A’ = A’. 
These facts give the 


CoroLtuary. If the set system F’ is the homomorph of the field (com- 
plete field) F', where F’ contains a null set, then F” is a field (complete field). 
Moreover, the correspondence between F and F”’ holds under the operation of 
difference.*® 


It is easy to prove that 


THEOREM VIII. The homomorph F’ of a complete field F, under a corre- 
spondence of sum and intersection in F with intersection and sum, respectwely, 
in F’, is a complete field provided F” contains a null set. 


THEOREM IX. If F’ is a set system which is the homomorph of the field 
F and if F’ contains a null set, then the class % of F which corresponds to 0° 
is a field, and the sets of any other class are expressible in terms of the sum 
and difference of the sets of % and some one member of that class.** 


Proof. If A and B of F correspond to 0’ of F’, then A + B and A—B 
correspond to 0’ because theorem VII assures us that the correspondence holds 
under difference. 

Let A of F correspond to A’ £ 0’ of #”. Suppose B of F corresponds to A’. 
We shall show that B is expressible as the sum and difference of A and the sets 
a3. Let A—B=YX. Since A —B corresponds to A’ — A’ = 0’, A—B 
belongs to 3. Likewise B— A = Y belongs to 3. Finally, B=A—X+Y. 


Corotiary. If & consists of the null set only, F is isomorphic to F’. 
THeorEM X. If the set system FI’ is the homomorph of the field F; 
it F’ contains a null set; and if the classes of F each contain a finite number 


of sets, then any class of F is a residue class with respect to % where & is the 
class of F which corresponds to 0’. 


* We refer to S later as the unit element of F. 

* This last statement is contained in theorem 42 of the paper of Stone’s referred 
to above. 

“Cf. van der Waerden, loc. cit., p. 56. 


ld 
| 
or 

te 
mn 

| 
id 

1S 
1s 

y 

W 

e, 

— 


144 MORRIS KLINE. 


Proof. Suppose K, is the class of sets which correspond to A’ of F’. 
Since K,4 contains a finite number of sets, the intersection of all these sets, 
A say, is a set of K4 since K, is a ring by theorem V. As in the preceding 
theorem, if A belongs to K4, then A — A belongs to 3. Then the class K, is 
the residue class A + 3, for A =A + (A—A) and this last is a set of 


Suppose the points of space 7” are the transforms under the single-valued 
function, f, of the points of space 7. If A is a set in 7’, let us understand 
the corresponding set in 7” to mean the set of points of 7’ which correspond 
to the points of A. We then have the 


THEOREM XI. Let the points of space T’ be single-valued transforms of 
the points of space T. Let F be a field, or complete field, of sets in T such 
that, if A and B are any sets of F for which A: B=0 then for the corre- 
sponding sets A’ and B’ in T’, A’: B’=0. The corresponding sets in 7’ 
form a field, or complete field, isomorphic to F. 


Proof. Under any single-valued transformation we have that if A -+B=(, 
where A and B are here sets of F, then A’ + 

Suppose A-B=D. Again we have at once that D’ C A’: B’ and we 
show first that D’ © A’- B’. Suppose p’ belongs to A’ and to B’. Then some 
correspondent of p’, p say, is in A, and some correspondent, q say, is in B. 
If p=q,p isin D’. If pq but both p and q belong to A and B, then j/ 
is in D’. If pq, and q does not belong to A say, then we shall have 
A-(B—A) =0, but A’- (B—A)’S0, contrary to hypothesis. Hence 
== A’ - B’, 

We have, so far, that the system F” of sets in T’ is the homomorph of F. 
The corollary to theorem VII gives us the fact that F” is a field (complete if 
F is complete). We note further that the only set of F which can correspond 
to the null set of F” is the null set of 7. Hence, by the corollary to theorem IX, 
F’ is the isomorph of F. 


8. The correspondence of sums and limits of sequences of sets. 

It is possible to have two fields isomorphic to each other and for the first 
field to contain the sum of a countable number of its sets!without the second 
field containing the sum of the corresponding sets. The following is al 
example of such a situation. Let the field F consist of a system of concentric 
circles of radii 1/2, 3/4, 7/8,- - -, a circle of radius 2, ard the difference of 
any two of these sets. Let the field F” consist of another system of concentric 
circles of radii 1/2, 3/4, 7/8,- - -, the sum of these circles, and the difference 
of any two of these sets. If we let circles of equal radii correspond and let 


t 
t 
t 
fi 
0 
li 
al 
fi 
W 
| th 
+ ple 
of 
j 
I 
n=1 
i 
A’ 
n=1 
i of 
| 
| 


HOMOMORPHISM OF RINGS AND FIELDS OF POINT SETS. 145 


the circle of radius 2 in the first system correspond to the sum of the sets in 
the second, the two systems will be isomorphic but the first does not contain 
the sum of the countable number of circles of radii less than one. Needless 
to add, the limit sets ** of topologically convergent sequences of sets of the 
systems may not belong to either system, much less correspond even when the 
sets of the sequences do. 

Hence we may consider the following questions. Given two isomorphic 
fields, under what conditions does the sum or intersection of a countable 
sequence of sets belong to the second field if the sum or intersection of the 
corresponding sets belongs to the first? Under what conditions do these sums 
or intersections correspond when they belong? Under what conditions do the 
limit sets of convergent sequences of corresponding sets belong to the fields 
and correspond? ‘These questions are considered in this part; part of the 
first question and others are more conveniently reserved for the next one. 

Essentially the problems which follow are concerned with conditions under 
which the isomorphism between two fields can be extended to new elements of 
the fields. The theorems hold in separable, metric spaces unless otherwise 


indicated. 


DEFINITIONS. A o-ring ts a ring to which the sum of each (countable) 
sequence of sets of the ring belongs. Similarly for o-fields..° By a complete 
o-field we shall mean a o-field with unit element. 


THEOREM XII. Let the complete o-field F” be the isomorph of the com- 
plete o-field F. Let {An} be any sequence of sets of F and {A’n} the sequence 


of corresponding sets of F’. Then > An corresponds to ee ILA, and 
114’, belong to F and F” respectively and correspond. 

Proof. Let A — >A, of F correspond to A’ of F”. Also, let 34’. of 
correspond to B of Since A- A, = Ay, A’: A’, = A’, and 
WDD Likewise, BD An. However, this last implies that 


n=1 n=1 


YA’, A’. Hence A’ => A’, and Ay. This proves the first part 
n=1 n=1 n=l 


of the theorem. 
If A, of F corresponds to A’, of F’, then C(An) corresponds to C(A’n) 


**The term limit set will be used to mean the limit set of a sequence of point sets 
converging in the topological sense. See Kuratowski, loc. cit., p. 155. 
*F. Hausdorff, Grundziige der Mengenlehre, p. 23. 


10 


Vv 
8, 

is 
of 
ad 
1d 
id 
of 

h 

|_| 


146 MORRIS KLINE. 


fo 
by the corollary to theorem VII. Then } C(A,) and 3 C(A’,) belong to F 
n=1 1 


n= 


and F” respectively and they correspond to each other by the first part of this 
co lo. 

theorem. Since [J] An =C(SC(A,)) and similarly for [J A’n, the rest of 
n=1 n=1 n=1 

the theorem follows. 


THeorEM XIII. Let the complete o-field F’ be the tsomorph of the 
complete o-field F so that closed sets of F correspond to closed sets of F’ and 
conversely. If the limit sets of monotonically increasing sequences of sets of 
F and F’ belong to F and F’ respectively, then the limit sets of all convergent 
sequences of sets of F and F’ belong to F and F” respectively and correspond 
when the sets of the sequences do. : 


Proof. We first prove that the limit sets of monotonically increasing 
sequences of corresponding sets correspond. Let A,C A,C A;C:-: be 
any monotonically increasing sequence of sets of F. The limit of this 


= | 
sequence is 2 An.” The sequence of corresponding sets {A’n} is such that 
n= 


co ie 
A’, C A’, C A’; C+ + and has for its limit A’n. Suppose An corre- 
n=1 


n=1 


co oO 
sponds to A’ of F” and > A’, corresponds to B of F. Since An > A’, 
n=1 


n=1 n=1 


co fo 
A’) > A’; and, since by hypothesis A’ is closed, A’ 0S A’n. Likewise 


n=1 n=1 


BD> An. Then, by the isomorphism, > A’, > A’. Hence A’ = > A’» and 
n=1 n=1 


n=1 


B == > An. 
n=1 


We now note that if A is any set of F and A’ the corresponding set of F, 
then A, the closure of A, belongs to F’, and corresponds to A’, which belongs 


to F’. For, if we form the sequence for which A, =A, then lim A, =A. 
n=O 


Likewise for A’ and the first part of the proof gives the fact that A corresponds 
to A’. 
Now let {A,,} be a sequence of monotonically decreasing sets of /’ and let 
{A’,} be the sequence of corresponding sets of F’. Then lim A, belongs to F; 
n=00 
lim A’, belongs to F’; and the first limit corresponds to the second. For, 
n=CO 


co oO - 
lim A, A, ** and lim A’, 4’n. By the preceding paragraph, 4s 


n=Co n=1 n=00 n=1 


*° Kuratowski, loc. cit., p. 155. 


a> 


H 
| 
fo 
( 
~ 


his 


HOMOMORPHISM OF RINGS AND FIELDS OF POINT SETS. 147 


and A’, belong to F and F” respectively and correspond. The preceding 
theorem gives the conclusions asserted in this paragraph. 

To complete the proof we utilize the following elementary lemma which 
is stated without proof. 


Lemma. If a sequence of sets has a limit, that limit is the limit of a 
monotonically decreasing sequence of sets formed from the sets of the first 
sequence by summing. 


To conclude the proof of the theorem we have but to note that if {An} 
is any sequence of sets of F' and {A’,} the sequence of corresponding sets of 


oo fo 
F’, then the set B, => Ax corresponds to B’, => A’, by theorem XII, 
k=n 


k=n 
By the lemma, lim An = lim B, and likewise for A’, and B’,. But lim By 
n=00 n=00 
belongs to ¥’ and lim B’, belongs to F” and the two limits correspond by the 
n=00 


results of one of the paragraphs above. 

The preceding theorem uses an hypothesis on the correspondence of closed 
sets which is, in a sense, a generalization of homeomorphism for two spaces. 
This is apparent if we specialize F and F” to the case where each consists of 
all the subsets of a given space. The question of for what isomorphic fields 
this condition on the sets implies a homeomorphism between the unit elements 
of the fields remains open as does the question of what additional conditions 
are necessary to imply this conclusion for any two isomorphic, complete o-fields. 


4, The correspondence of open and closed sets. This section will 
consider primarily conditions under which open and closed sets in one field 
correspond to open and closed sets in a homomorphic or isomorphic field. 


THeorEM XIV. Let F” be a complete field which is the homomorph of 
the complete field F. Suppose that limit sets of monotonically increasing (or 
decreasing) sequences of corresponding sets of F and F’ belong to F and F’ 
respectively and correspond. If A is a set of F open in the unit element S 
of F, then A’ is open in S’ of F’. 

Proof. Let O,—=C(A).2? Then {C,} is a monotonically increasing 
sequence of sets with limC,—C(A). By the corollary to theorem VII, 


n=00 
((A) corresponds to ((A’) and we have, by the hypothesis on limit sets, 
that C(A) corresponds to ('(A’) because the latter is the limit of the sequence 
for which 0’, = C(A’). Since A is open in S, C(A) is closed in 8S. Hence 


Tbid. 
#2 It is to be remembered in this theorem that complements are with respect to the 


unit elements of the complete fields. 


of 
ud 
nt 
nd 
ng 
be 
is 
lat 
/ 
ise 
nd 
gs 
A, 
ds 
et 


148 MORRIS KLINE. 


C(A) =C(A). Then C(A’) = C(A’), and C(A’) is closed in 8’ or A’ is 
open in 8’. 
THEOREM XV. Let F” be a field which is the homomorph of the field F. 


Suppose the limit sets of monotonically decreasing sequences of corresponding 
sets of F and F” belong to F and F” respectively, and correspond. Then if 


{An} 1s a@ sequence of open sets of F such that > An belongs to F, then ¥ 4, 


n=1 


belongs to F’, and > A, corresponds to > A’n. 
n=1 n=1 


co 
Proof. Let > A, =A. Suppose A corresponds to A’. Since A: An = An, 
n=1 


co 
A’: A’, =A’, and A’ > > A’, = B’. We shall show that A’ — B’ = 0. 


n=1 


n 
Let C’n = A’ — > A’;; then {C’,n} is a monotonically decreasing sequence 
4=1 


of sets of F” whose limit exists and is Il(4’— 3 4"). Call this limit C’. 
Since A’ = TI (4’—3 4%), C’ A’— B’. A sequence in F corre- 
sponding to {C’,} is {Cn} = (4—3 Ad}, and this sequence has as limit 
C = (A — A). In view of the hypothesis on limit sets C corresponds 


to C’. But the A,’s are open sets, as are the sets } A; for each n. Then 


n 
A — > A; contains no points of } A; for such points are interior points of 
4=1 4=1 


vA and hence cannot be limit points of the complement in A. Since 
4=1 

co 
A => An, the value of C shows that C-A—0O. In view of the homo- 
morphism, 0’: A’ 0’. Since C’ > A’—B’ it must be that A’ — B’=0, 
and the theorem follows. 


Corottary. Under the hypothesis of the theorem and the added condt- 


tion that F be complete we have that > A’, is open in 8’, the unit element 
n=1 
of F’. 
Proof. By the corollary to theorem VII, F’ is complete. Moreover, 


since > A, is open in the space, it is open in 9, the unit element of F. Then, 


n=1 


co 
by theorem XIV, } A’, is open in 8’. 
n=1 


n=1 
4 n 
= 
| 
| 
| 
i 
i 
| 
| 
i 
| 


HOMOMORPHISM OF RINGS AND FIELDS OF POINT SETS. 149 


THEOREM XVI. Let the complete field F’ be the homomorph of the 
complete field F. Suppose the limit sets of monotonically decreasing sequences 
of corresponding sets of F and F’ belong to F and F” respectively and corre- 
spond. If {An} is a sequence of sets of F closed in S, the unit element of F, 


and such that ee belongs to F, then I A’, belongs to F’ and is the corre- 
spondent of II An. Moreover, II 4's, is closed in 8’. 

Proof. Since A» is closed in 8S, C(An) is open in S. Then, since II 4» 
belongs to F’, and since 4») — belongs to F. By 
theorem XV, YO(4's) belongs to F” and is the correspondent of Dy (An). 


But Il A’, =C (> C(A’n)) and complements correspond under homo- 

By the corollary to theorem XV, ye (A’,) is open in S’, and hence 
is closed in 8’. 
P Several of the theorems in part III depended in part upon the hypothesis 
that closed sets in a field #' correspond under isomorphism to closed sets in 


a field #’. The following theorem gives sufficient conditions for this to be 
the case. 


THEOREM XVII. Let F’ be a field which is the isomorph of the field F. 
Let corresponding sets of F and F’ be of the same dimension,”* and, moreover, 
be homogeneously dimensional.** Let A of F be a set for which dim B(A)*® 
< dim A, and let the same condition hold for the corresponding set A’ of F’. 
If A and A’ belong to F and F’ respectively, then A corresponds to A’. 


Proof. Let A of F correspond to B’ of F” and let A’ of F” correspond 
to C of F. Since A’, CDA. Likewise A’. Let C—A = Fj. 
Then A’ — B’ = EF’, where EH’, is the correspondent of H,. Suppose the 
dimension of A is k. Since B(A) is a closed set, and hence both an F’o and 


22In the sense of Menger. See his Dimensionstheorie, chap. LI. 

24 A set is homogeneously dimensional if it is of the same dimension in each of its 
points. 

*° Menger defines the boundary of a set only for open sets. See his Dimensions- 
theorie, p. 34. We use the more general definition of Kuratowski, loc. cit., p. 24. There 
the boundary of a set A is defined to be A-O(A). 


_| 


150 MORRIS KLINE. 


a Gs,?° and since A = A + B(A), A is k-dimensional.?” Then, by hypothesis, 
B’ is k-dimensional. Since A is k-dimensional, again by hypothesis, A’ is. 
Then, by reasoning similar to that just used, A’ and C are k-dimensional. 

Since B’ > A’, E’,, which is A’ — B’, is a subset of the boundary of A’, 
and, since dim B(A’) < dim A’, dim FE’, < k.* Hence, EF, is of dimension 
<k. Since C=C:-A+#,, and since A-2,—0, C must be at most 
(% —1)-dimensional in the points of #, because such points can be enclosed 
in neighborhoods whose boundaries do not intersect A and hence intersect ( 
in points of #, only. However, since C is homogeneously dimensional, HL, = 0, 
In view of the isomorphism, 2’; 0. Then B’> A’ and ADC. Now let 
A—C=EH,. Then B’ — A’ = EF’, where E’, corresponds to E,. In view of 
the symmetry due to the isomorphism we may use the process just employed 
to show that H’, 0. Then we have that B’ C A’, and therefore that B’ = A’. 
Hence A corresponds to A’. 

The above theorem has application as the following two corollaries show. 


CoroLuary I. Let the hypothesis of the above theorem relating to F 
and F’ hold for F and F’ in the Cartesian n-dimensional space, Rn. Let A 
and A’ be corresponding, open sets of F and F” respectively. If A belongs to 
F and A’ to F’, then A corresponds to A’. 


Proof. We have but to apply the theorem ”® that in R, the boundaries 
of A and A’ are of dimension = n — 1. 


Corouuary II. Let the hypothesis of the above theorem relating to F 
and F” hold in Rn, where F and F’ are now o-fields. Let {An} and {A’n} be 
sequences of corresponding, open sets of F and F’ respectively. If F and F’ 


fo 

each contain the closures of their sets, then An and > A’, belong to F and 
n=1 n=1 

F respectively and correspond. 


co 
Proof. That } A, and > A’, belong to F and F” respectively follows 
n=1 n=1 


from the definition of o-fields. They correspond to each other according to 


oo 
theorem XII. Then >A, and > A’, belong to F and F respectively by 
n=1 n=1 


hypothesis and they correspond to each other by the first corollary. 


NEW YorK UNIVERSITY. 


*" Hausdorff, Grundziige der Mengenlehre, p. 306. 
*7 Menger, loc. cit., p. 114. *© Tbid., p. 81. 2° Menger, loc. cit., p. 268. 


8 
C0 
W 
pl 
t 
in 
av 
su 
a 
ok 
a 
| 
se 
i id 
sy 
fie 
cl 
al 
a 
al 
| 


POLYNOMIAL IDEALS DEFINED BY INFINITELY NEAR 
BASE POINTS.* 


By Oscar ZARISKI. 


Introduction. The linear systems of curves, in the plane or on an algebraic 
surface, which are theoretically of importance, are the complete systems. For 
complete linear systems the defining linear conditions are base conditions, by 
which the curves of the system are constrained to pass with assigned multi- 
plicities through an assigned set of base points. The set of base points may 
consist in part of proper points and in part of points infinitely near and in 
the successive neighborhoods of the proper points (7°, p. 27). However vague 
this geometric terminology may sound, it is nevertheless true that the facts 
involved have a precise algebraic meaning, and satisfactory definitions are 
available in terms of analytical branches and of intersection multiplicities of 
such branches. But it is equally true that although a well rounded geometric 
theory can and has been developed along these lines (*, pp. 327-399), the 
arithmetic content of the notion of infinitely near points still remains somewhat 
obscure. It is the main purpose of the present investigation to develop an 
arithmetic theory parallel to tie geometric theory of infinitely near points (in 
the plane or on a surface without singularities). By this we mean primarily 
a systematic study of those polynomials ideals in f[a,y], or formal power 
series ideals (in two indeterminates), which adequately describe linear con- 
ditions having the character of base conditions. We call these ideals complete 
ideals (II, 12) by analogy with the terminology used in the theory of linear 
systems. We always suppose that the underlying field f is algebraically closed 
and of characteristic zero, but the theory could be extended with a few modi- 
fications to fields of any characteristic. At any rate, the hypothesis that the 
characteristic is zero is not used in the first four sections of Part I. For 
possible generalizations to spaces of higher dimension than 2 it would be im- 
portant to consider also fields which are not algebraically closed. 

The class of complete ideals enjoys several striking properties, which 
respond, however, to a high geometric expectation. This class is closed under 
all standard operations on ideals (except addition ) : the intersection, the product 
and the quotient of two complete ideals is a complete ideal (II, 12). Moreover, 
acomplete ideal has a unique factorization into simple complete ideals (I, 7), 
an ideal being simple if it is not the product of ideals different from the 
unit ideal. 


* Received November 8, 1937. 151 


is, 
is. 

on 
st 

ed 

(), 

et 
of 

ad] 

F 
A 
AD) 

eg 
F 

/ 

d 

0 

y 


152 OSCAR ZARISKI. 


We define complete ideals ir terms of valuation ideals. By vatuation 
ideals in the polynomial ring f[z,y] we mean the contracted ideals of the 
ideals of any valuation ring (belonging to some valuation of the field f(z, y) ) 
which contains z and y. Valuation ideals are complete ideals, and, in particular, 
the simple complete ideals are valuation ideals. Most of our work is a study 
of valuation ideals (briefly: v-ideals) whether from an axiomatic point of view 
(Part I) or from the point of view of formal power series (Part II). The 
study of the behaviour of valuation ideals under quadratic transformations 
(1, 4 and 5) leads to reduction theorems (Theorems 4. 4 and 5.3) which form 
the basis of many inductive proofs. A simple v-ideal of kind k 4-1 (1,6) repre- 
sents the arithmetic analogue of the notion of a point infinitely near and in 
the k-th neighborhood of a proper point. 

The treatment deals explicitly only with polynomial rings and rings of 
holomorphic functions. It is clear, however, that the results carry over auto- 
matically to algebraic surfaces without singularities, since any set of base 
conditions at a simple point P of an algebraic surface is described by a complete 
ideal in the ring of holomorphic functions of the uniformizing parameters at P. 

We make one more remark. In many instances the proofs do not depend 
on the fact that we have only two indeterminates. On the other hand, in many 
points a generalization to any number of variables faces new difficulties. In 
spaces of higher dimension the base conditions may be of a more complicated 
type: besides base conditions at an isolated base point, it is possible to have 
infinitely near base curves, or infinitely near base surfaces, etc. At an isolated 
base point we may also have such base conditions as are given by infinitesimal 
base curves (the case of an assigned tangent plane at a base point is the simplest 
example). The algebro-geometric theory has as yet no firm grasp on these 
eventualities. A generalization of the present treatment to any number of 
variables would therefore represent not merely an arithmetization of a known 


chapter of classical algebraic geometry. 


CONTENTS. 

Part I. PAGE 
1. Valuation ideals in rings of polynomials in two indeterminates. . 153 
2. A characteristic property of a Jordan sequence of valuation ideals. 157 
3. Further properties of v-ideals. . . 160 
4, v-ideals and quadratic transformations. . 163 
5. Simple and composite v-ideals. . 168 
6. Properties of simple v-ideals. . ‘ . 169 
%. A factorization theorem for v-ideals. . 1? 


| 
. 
( 
| 
i 
0 
A 
Al 
or 
Re 
48s 
| 


POLYNOMIAL IDEALS. 153 


Part II. PAGE 
8. Algebraic and transcendental valuations. . 173 
9, Valuation ideals in the ring of holomorphic functions. , « BF 
10. The notion of a general element of an ideal of formal power series. 179 
11. The characterization of simple v-ideals. . 184 
12. The class of complete ideals. . ‘ 198 
13. Simple v-ideals and divisors of the second kind. ‘ ‘ . 201 

PART I. 


1. Valuation ideals in rings of polynomials in two indeterminates. 
Let £ be an algebraically closed field of characteristic zero and }—a pure trans- 
cendental extension of f of dimension (degree of transtendentality) two. We 
consider a valuation B of &, i.e. an homorphic mapping of the multiplicative 
group & (the element 0 excluded) upon an ordered abelian group T: a—> v(a) 
= value of a, a0, v(a) eT, satisfying the valuation axioms: 


(1) v(a-b) =v(a) + 0(b); (2) v(a +b) min.(v(a), 0(5)) ; 
(3) v(a*) 40, 


for some a* in & (*, p. 101; *).1. We assume, moreover, that the elements of 
the underlying field £, other than 0, have value zero in the given valuation B. 

Let 8 be the valuation ring of B (the set of all elements of = whose value 
is= 0). Any ideal a in has the following self-evident property: a=0(a), 
»(b) = v(a) implies b==0(a). Conversely, any subset of 8 with this property 
constitutes, together with 0, an ideal. Consequently, given any two ideals 
a, a’ in B, either a =O0(a’) or a’ =O0(a). We say that a precedes a’ (and 
that a’ follows a) if a’=0(a), but a’ ~a. The ideals in B form then an 
ordered set. The unit ideal % is the first element of this set. The immediate 
successor of B is the ideal $ consisting of all elements whose value is positive. 

Let © be a domain of integrity in 3, contained in the valuation ring ®. 
An ideal %& in shall be called a valuation ideal, or briefly, a v-ideal, belonging 
loor for the valuation B, if & is the contracted ideal of an ideal a in &, i.e. 


1Note the following consequence of the above axioms: if v(a) < v(b), then 
=v(a). Proof: We have v(a) =v(a-1) =v(a) + whence v(1) = 9. 
Also v(— 1) + v(— 1) = v( (— 1) 2) = v (1) = 0, consequently v(— 1) = 0, since T is an 
ordered group. Hence v(— b)=v(b), and, by axiom (2), v(a—b)2min.(v(a), v(b)). 
Replacing in this relation a by a+b we find v(a)2min.(v(a + b),v(b)), and our 
assertion follows. 


yn 
1e 

) 

ly 

W 

8 
m 

e- 

n 
of 

d 

d 

e 

d 
i] 

t 
if 
n 

E 
3 

) 


154 OSCAR ZARISKI. 


if Y—[H,a]. If the reference to the specific valuation B is omitted and 
if we speak of 9 as a v-ideal, we shall mean then that W is a valuation ideal 
for some valuation B of = such that the valuation ring of B contains D. We 
have thus defined in © a special class of ideals, the class of valuation ideals, 
We agree to include the zero ideal in this class. Also the valuation ideals in 9, 
belonging to a gwen valuation B, enjoy the property: a= 0(%), v(b) = v(a) 
implies b =0(%), where & is a v-ideal and a, b are elements of ©. Con- 
versely, any subset of with this property constitutes, together with the zero 
element of &, a valuation ideal belonging to B. The valuation ideals in 9, 
belonging to a given valuation B, form an ordered set: 9% precedes Y if 
W’ =0(M%) and AA°A. Since a v-ideal in for the valuation B, is the 
contracted ideal of an ideal a in 8, YW is also the contracted ideal of its 
extended ideal $Y in B. There is thus a (1,1) correspondence between the 
v-ideals in © belonging to the valuation B and their extended ideals in the 
valuation ring 8. These extended ideals form in general a proper subset of 
the set of all ideals of B. 

Let, in particular, be the ring of polynomials in x, y, where we assume 
that z and y are generating elements of = (3 —k(z, y)) and elements of the 
valuation ring.* From the fact that every ideal in © possesses a finite base 
and from the valuation axiom (2), it follows that any ideal Mt in © contains 
elements of smallest possible value in B. If « is this minimum, « = min{v(a)}, 
ae M, we shall write « —v(Mt) and we shall regard « as the evaluation of 
the ideal M. Two ideals Mt, M’’ will be said to be equivalent, in symbols 
M~ M’, if v(Mt)—v(M’). The class {Mt} of all ideals equivalent to M 
contains one and only one v-ideal for B, namely the ideal 9% consisting of all 
elements whose value is = v(M). Clearly MW is a divisor of any ideal of 
the class. In the ordered set of v-ideals of , belonging to the valuation B, 
W precedes W’ if < v(M’). 

In the sequel we shall be dealing with a fixed valuation B and it will be 
understood that when we speak of a v-ideal in D we mean a v-ideal “ belonging 
to the given valuation B.” 

The valuation ring 8 contains the divisorless ideal $8, consisting of all 
elements whose value is > 0. We shall denote by the contracted ideal of f 
in ©; is obviously a prime ideal in D. By the dimension r of the valuation 
B is meant the dimension of the ideal §, i.e. the degree of transcendentality 
of the field of residual classes 8/$ over f.* Evidently, r is either 0 or 1. 


*Note that if # is any element in >, then either x or 1/2 belongs to §, sinc 
v(l/7) =—v(@). 

* By hypothesis, 0 is the only element of § which belongs to 9. Hence F/B com 
tains a subfield isomorphic to §. We identify this subfield with f. 


i 
| 


POLYNOMIAL IDEALS. 155 


THEOREM 1. In the ordered set of v-ideals in © every v-ideal A has an 
immediate successor W’. If the valuation B is of dimension zero, then W is a 
maximal subideal of A, and the ring A/W is isomorphic to the underlying 


field 


Proof. By the valuation axiom (3), there exists an element a* such that 
v(a*)>0. Leta* —f(2,y)/g(a,y), FeO, geO. Since v(f)—v(a*) + v(g) 
and v(g) = 0, also v(f) >0. If then a—v(W), there exist in 2 elements 
whose value is greater than «, for instance, the elements of %f. The totality 
of all polynomials whose value is greater than « constitutes, together with the 
element 0, a v-ideal Y%’, contained in Mf, and clearly there exist no v-ideals 
which follow 2% (proper multiples of 9%) and precede YW’ (proper divisors 
of Mf’). 

Assume that B is of dimension 0, whence 8/§ is an algebraic extension 
of the underlying field f. Since f is algebraically closed, it follows that 6/% 
is isomorphic to f. Let f be an element of % of smallest possible value, 
v(f) = v(M), and let be any other element in Since v(¢?) 2 v(f), we 
have v(¢/f) = 0, i.e. ¢/f belongs to B. Hence ¢/f =c($), where c is in f, 
0(¢/f —c) =v[(¢—cf)/f] > 0, i.e. cf) > v(f), and consequently 
¢—cf=0(H’). This shows that %/%’ ~ f and also that W is a maximal 
subideal of Wf, q.e.d. 

If B is of dimension 1, it defines an homomorphism of the field % upon 
the field 8/9$ of algebraic functions of one variable, i.e. B is a “ divisor ” of 
x. If contains elements which mod » are transcendental with respect to f, 
i.e, if p is a 1-dimensional ideal in ©, we are dealing with a divisor of the 
first kind with respect to ©. The ideal p is then a principal ideal, say p = (f), 
where f is an irreducible polynomial, and the v-ideals in © are the ideals 
p= (f"), n=0,1,2,---. If, however, p is 0-dimensional, we are dealing 
with a divisor of the second kind with respect to D. The v-ideals in © are in 
this case certain primary ideals belonging to p. We need not consider separately 
this case, because it reduces to the case of 0-dimensional valuations. In fact, 
we may consider an arbitrary valuation B, of the field 3, = B/® (a point of 
the Riemann surface of the field 3,). The given valuation B of 3 followed 
up by the valuation B, of 3, defines an homomorphism of & upon the under- 
lying field £ (together with symbol o), hence a 0-dimensional valuation B’ 
of 3. The v-ideals in © belonging to B will be among the v-ideals belong- 
ing to B’, 

From now on we shall only consider 0-dimensional valuations. If B is 
0-dimensional, then the v-ideal p = [8,9] is prime and 0-dimensional, since 


nd 
eal 
ls, 
a) 
if 
he 
its 
he 
he 
of 
me 
he 
ase 
ns 
)}, 
of 
ols 
M 
all 
of 
B, 
be 
ng 
all 
om 
ty 
ce 


156 OSCAR ZARISKI. 


it is the immediate successor of the unit ideal © and since, by Theorem 1, 

=F. Replacing, if necessary, x and y by x—c, y—d, where r=c(p), 

y =d(p), we may assume that c=0(p), y=0(p), whence p = (2, y). 
Starting with © —4q, and with its successor p = q,, we form the simple 


sequence of v-ideals 


where @j,;, 1s the immediate successor of q;. Hach qi., 1s a maximal subideal 
of tts predecessor qi. We shall call a Jordan sequence any sequence of ideals 
having this last mentioned property. It is clear that all the ideals q;, i=1, 
in the Jordan sequence (1), are primary ideals belonging to p(—q,). In 
fact: (1) qi==0(p) ; (2) ab=0(q), 0(q) imply that v(a)-+ > v(a), 
whence v(b) >0 and b==0(p) ; (3) << << o(p?) 
whence v(p‘) = v(qi) and consequently = 0(q:). 

Two cases are possible: (a) either the intersection of all the ideals 9; is 
the zero-ideal; (b) or this intersection is a certain ideal p, (0). In the 
first case the Jordan sequence (1) contains all the v-ideals of © belonging to 
the given valuation B. We investigate now the second case. 

We first prove that p, is a prime ideal. In fact, let ab =0(1), a4 0(,), 
and let q, be the last ideal in the sequence {qi} which contains a. Then 
v(a) = v(q,), whence v(a) S pv(p), where p is the exponent of the ideal q,. 
No power p” of p can belong to all the ideals qi, since p"/p is of finite rank 
with respect to f. Hence, for any integer n > 0 there exists an integer % 
such that v(qa,) > v(p"). Since ab =0(qi) for any value of 1, it follows 
that v(a)-+ v(b) > nv(p), narbitrary. In view of the inequality v(a)= pv(p), 
we deduce that also v(b) > nv(p), n—an arbitrary integer. In particular, 
if qm is any ideal in the sequence {q;} and if pm is its exponent, we will have 
V(Qm) S pmv(p) < v(b), whence b=0(qn), for any m. It follows that 
b = 0(,), i.e. pi is a prime ideal. 

The ideal , is one-dimensional, say (f), where f is an irreducible poly- 
nomial. The inequality v(b) > nv(p), n arbitrary, holds for any element b 
of p,, and this shows that the value group of B is non-archimedean (B is 4 
“special ” valuation, of rank 2. See *, p. 113). It is not difficult to see that 
all the v-ideals in © for B are of the form p,"qn, m,n = 0,1, 2° In fact, 
let F, and F, be any two polynomials in and let F, = f™G,, F. = 
where G, and G, are not divisible by f. If m, > me, then v(F,) > v(F:2); 
since vu(f) >v(G.). If then v(F,) >v(F2) or v(F:) = v(F2) 
according as v(G,) > v(G.) or v(G,) =v(G@.). Hence the set of all poly- 
nomials whose value is not less than the value of a given polynomial f” 


POLYNOMIAL IDEALS. 157 


(G@40(f)) coincides with the ideal p,"qn, where v(qn) =v(G@). This form 
of the v-ideals brings out clearly the well-known decomposition of the valua- 
tion B into two valuations of rank 1 and the nature of the value group, as 
consisting in this case of pairs of integers. B decomposes into two valuations, 
B’ and B. B’ is the one-dimensional valuation defined by the prime 1-dimen- 
sional ideal p,, and its valuation ring 8 is the set of rational functions 
F(z) G0(f). B’ maps & upon the field § = (2, 9) of algebraic func- 


G(x,y) ° 
tions of one variable, where & is the quotient field of the ring O/p,. B is a 


valuation of 3, and the sequence {qi} (with x,y replaced by #,%) is the 
sequence of the valuation ideals in the ring © = £[<Z, ¥] which belong to B.* 


2. A characteristic property of Jordan sequence of valuation ideals. 
Given a Jordan sequence of ideals in ©: qo,@1,42,° °°, Where q% 
#1 = p = (x,y), we ask under what conditions will there exist a valuation 
of & for which the given sequence {qi} is the sequence of (zero-dimensional ) 
v-ideals. In other words: under what conditions does the given sequence {qi} 
belong to a valuation of 3? 


THEOREM 2.1. A necessary and sufficient condition in order that a Jordan 
sequence {qi} of 0-dimensional ideals in © belong to a valuation of the field 3, 
is that the quotient qi: (a) belong to the sequence, for any i and for any 


elements a in ©. 


THEOREM 2.2. A necessary and sufficient condition in order that a Jordan 
sequence {qi} of 0-dimensional ideals in © belong to a valuation of the field &, 
is that the congruences 


hold true for any pair qi, q; of ideals of the sequence. 
The characterization given in Theorem 2. 2 has the advantage of involving 
only operations within the given sequence {qi}. We prove both theorems 


simultaneously. 


(1) The conditions are necessary. As to Theorem 2.1, let v(a) =, 
qi: (a) =’, and let b, c be any two elements of © such that b=0(q’), 
v(c)= v(b). Since ba =0(ai), we have v(c)+ v(a)= v(b)+ v(a)= v(4qi), 
whence ca =0(q;), and consequently c==0(q’). Hence q’ enjoys the property: 

‘It is possible to have p, = (0) also in the case of a valuation of rank 2. This 
happens when the component B’ (divisor of 2) of the valuation B, of rank 2, is a 


divisor of the second kind with respect to . (In geometric terms: the divisor B’ is 
4n exceptional curve which has been transformed into a point of the plane (a, y)). 


1, 
ple 
eal 
als 
“1, 
In 
1), 
‘), 
is 
he 
to 
en 
k 
On 
WS 
); 
iT, 
re 
at 
b 
a 
at 
t, 
), 
) 


158 OSCAR ZARISKI. 


b=0(q’), v(c) 2 v(b), implies c=0(q’), and is therefore a v-ideal belonging 


to the given valuation B. Since q; = 0(q’), necessarily 9’ = qj, for some j Si. 


The necessity of the condition of Theorem 2.2 is proved in a similar 
manner. Let 6 be an element of qiqj.::q;, whence bq; =0(qigj.1). Since 
= + we have v(b) + v(q;) = + and 
since v(q;) < v(qj.1), it follows, v(b) > v(qi), i.e. g.e.d. 

(2) The conditions are sufficient. We introduce the following notations; 
if €,m are elements of ©, we write =, (or »=&), if the congruence 
€=0(q:) always implies 7 =0(qi) ; we write é < y (or 7 > &) if there exists 
in the sequence {q;} an. ideal qm such that 7»=0(qm), €0(qm). We now 
prove the following lemma, assuming that the condition of Theorem 2.1 or 
that of Theorem 2. 2 is satisfied. 


Lemma. If é, 7, € are elements of D and tf &y7=0(9i), then S implies 
& =0(qi) and < implies & =0(qis1). 

In other words: S ¢ implies S &3 < € implies that either & < & 
or that & and & belong to all the ideals q; of the sequence. 

Assume the condition of Theorem 2.1, and let qi: (€) =n. Since 
7 = 0(qn), it follows that if then also £=0(qn), whence = 0(qi), 
and this proves the first part of the lemma. Let qis: (€) = as, s = h, and let 
¢@ and wy be any pair of elements of qr. We have =0(qi), 
whence there exist elements c, d in the underlying field £ such that 
coé + dvé=0(qis1), since qi/qis. is of rank 1 with respect to f. Hence 
ch + dy =0(qz), for any two elements ¢, ¥ in q, and for appropriate elements 
c, d in £, i. e. qn/qe is at most of rank 1 with respect to f, and s is either h or 
h-+1. Since »=0(q), it follows that if < ¢, then £=0(qni1) =0(e), 
whence ££ = 0(qi41), q.e.d. 

Assume the condition of Theorem 2.2. If 7 belongs to all the ideals q 
of the sequence, the same will be true for £, and both parts of the lemma are 
trivial. Assume that there exists a last ideal q, which contains 7: 7 = (qn), 
If then also whence =0(qiqn). Assume, 
if possible, ££+40(qi). There will then exist an ideal q; in our sequence, 
j <i, such that 0(q;), Since +1 it, qi =0(qj,1), the 
congruence nf =0(qiqn) implies Now qj. is a maximal 
subideal of gq; and & is in qj but not in qj,,; hence qj = (&£, 9j,,), and con- 
sequently, qj = (n&£, since »=O0(qn). It follows that 
= in contradiction with our hypothesis ¥ 0(qi.1): 
This proves the first part of the lemma. 

Let now < and hence £=0(qn.1). We have then = 0( 


POLYNOMIAL IDEALS. 159 


and also by the first part of the lemma, just proved, & =0(qi). Since qr 
is a maximal subideal of q, and since 7 is in qn but not in qn, we have 
= (9, and consequently é€qn = (é€n, E€qni1) = Hence 
& = Qn) =O0(Gisr), d. 

The rest of the proof is based solely on the above Lemma. Let p, be the 
intersection of the ideals q; of our sequence. We prove that p, is a prime 
ideal. We first observe, thatz and y cannot both belong toqe, sinceq, =p =(z,y). 
Let, for instance, =0(q1), x 40(qe), whence We deduce from our 
lemma, that if a given power of z, say w”, belongs to an ideal q; of the 
sequence, then also all the power products a*y’, k + 1= m, belong to qi, i.e. 
py"=0(qi). Since p”/p is of finite rank with respect to f, it follows that 
no power of x can belong to all the ideals qi, 1. e. to $y. 

We next observe that all the ideals qi, 1 > 1, are primary ideals belonging 
to p(=—aq.i). In fact: (1) qi=O(p). (2) Let ab=0(qi), a40(q), 
whence a< ab. It is not possible to have b= 1, because this would imply, 
by our lemma, ab = a, in contradiction with ab >a. Hence 1 < b and since 
Lis in qo, b must be in q;, i.e. in p. (3) We have 1 < 2, hence, by our Lemma, 
«<a (since no power of x belongs to p,), whence, again by the Lemma, 
and generally, r< a? --< xt. Consequently 2? =0(q2), 
The congruence 2‘ = 0(q;) implies, as we have 
just shown, the congruence pt = 0(qi). 

To prove that p, is prime, let &;=0(p,) and assume, if possible, that 
E40(p,), 7 Let be the last ideal of the sequence {qi} which 
contains é and let similarly qs be the last ideal containing y. If m and n are 
the exponents of the primary ideals q, and q, respectively, we have 7” = 0(qr), 
2"=(0(qs.), whence = é, 2" => and consequently, by our lemma, = &y, 
le. 2" == ()(p,), and this is impossible. Hence p, is prime, necessarily either 
the zero ideal or one-dimensional, since p, = 0(p). 

We now consider the well ordered descending set S of the ideals qmn = Pi." Qn, 
where clearly Qmn==0(m n,) if m>m, or if m=m, and Here 
fon = Qn and if p, = (0) it is understood that S coincides with the sequence 
{fn}. In either case, the intersection of all the ideals qmn of the set is the 
zero ideal. Hence, given any element é in ©, there will exist a first ideal in 
the set S, say qij, which does not contain é. Let 7 be another element in © 
and let q;-;- be the first ideal in the set which does not contain 7. We complete 
and modify our notations é S y, € < 7 introduced above, as follows: we write 
Eby if and if and qvy Agi. It is 
obvious that if } » and Z, then and if E<y and &, or if 
€ > » and n<¢, then €<£. From our Lemma and from the fact that p, is 


ng 
i. 
lar 
nce 
nd 
ns: 
nee 
sts 
OW 
or 
hes 
& 
ce 
‘)s 
let 
i); 
at 
ice 
ats 
or 
8 ), 
qi 
re 
h)s 
ce, 
he 
al 
at 
1). 
a) 


160 OSCAR ZARISKI. 


a principal ideal, it follows in a straight-forward manner that the relations 
E>}, <7 imply, for any element ¢ in ©, the relations & } nf, &€ < nt 
respectively. It is also evident that if € } y, € } 7’, then € } y +177, and if 
Ex<yandé<y thnécyty. 

Let 8 be the set of all elements in the field = which can be put in the 
form 7/é,7,€ée€D,€E by. We prove that B is a valuation ring. 

First, Bis a ring. In fact, let B, B. Since and & } m, 
we have and  &m, whence } + ém, i.e. 


7, m_ i+ 


& é, 
Also, since } ym, the product belongs to B. 

To prove that % is a valuation ring, it is sufficient to show that given any 
two elements a, b in %, then either ab or ab is in B (?, p. 102). In other 
words, we have to show that given any two elements é, 7 in ©, either €/y or n/é 
must belong to 8. But this is obvious, since one of the two relations é } , 
7 > é must hold true. 

Finally, the valuation abstractly defined by the valuation ring 8 is not 
the trivial one, in which every element has value zero. In other words, the 
ring 8 does not contain all the elements of the field 3. In fact, take two 
elements é,7 in © such that definitely, €< ». We assert that €/y does not 
belong to 8. Assuming the contrary, we must be able to put €/y in the form 
Where é, m, and , ym. By our lemma, &, implies yé, mm, 
while € < 7 implies £; < 7&1, i. e. ym: < né:, giving two contradictory relations. 

It remains to show that the ideals qmn of our well ordered set S are the 
v-ideals belonging to the valuation B defined by the valuation ring 8. Con- 
sider any ideal qm and let é, 7 be elements in © such that ==0(qmn), 
v(n) v(é). Since v(y/é) 20, we must have = where m. 
Hence } &ém, i.e. yé,, and this implies € } Hence 7 =0(qmn), 
and thus qmn enjoys the property: €=0(qmn), = v(E) implies =0(mn)- 
Hence mn is a v-ideal belonging to the valuation B. That the set S = {mn} 
contains all the v-ideals for B, is implied by the fact that each qmn (n #0) 
is a maximal subideal of its immediate predecessor qm,n-1, and that qmo is the 
intersection of the ideals which precede it in 8. 


3. Further properties of v-ideals. We consider the sequence {qi} of 
0-dimensional v-ideals in © belonging to a fixed valuation B of %, where 
9: = p—(z,y) and We may assume Since 
of rank 1 with respect to f, there exists elements c, d in £ such that 
cz + dy=0(q2), d40. We then replace y by cx + dy and thus we may 
assume y==0(q2), whence q2 = (y, 2”). 


POLYNOMIAL IDEALS. 161 


Let p” be the highest power of p which divides a given v-ideal qi; 
q,=0(p"), gi Z0(p"*). Every polynomial f in q; is then of the form 
f=fn-+ *, where f; is homogeneous of degree in and y. We shall 
call f, the subform of f (for particular polynomials in qi, fx may be identically 
zro). As f varies in qi, its subform f; generates a linear system of forms, 
of a certain dimension r = 0 (a f-module of rank r-+-1). Denote this system 
by Q(qi). We define in a similar manner the symbol 02(%) for any ideal 


Win O. 


TueEorEM 3. Let q;=0(p"), qi 4O0(p") and let A= p*], Zh. 
If Q(M) ts of dimension r, then Q(X) coincides with the system of forms 
(of degree k) which are divisible by y*", and, moreover, there exists a v-ideal 
q; in the sequence {qn} such that A= p'q;. 


Proof. Since q; contains polynomials @ whose subforms are exactly of 
degree h, it also contains polynomials (such as z*") whose subforms are of 
degree &. Hence 9 540(p*"), and 2(%) consists of forms of degree k. Let 
f=fe+ be a polynomial belonging to such that f, 0, and let 
f= #O(y). Since y? and are relatively prime, every form g, 


of degree m = k —1, can be expressed as a linear combination Ayxp + By?, 
where A and B are forms of degree m —k and m—p respectively. It follows 
that given any integer n = 0, it is possible to find two polynomials P™ (2, y), 


Q™ (x,y) of the form 


9) = + Api (2, y) Apsn(2; 


(3) 
(x,y) = + +° + 9), 


where A;, B; are forms of degree 1, in such a manner as to have 
(4) f=POQ™ (pe), 


In fact, we have for the unknown form Ai, B; the equations 


i-1 
+ Bi-priy? = — X Apri Br-psi-i, (t= 1,2,---,m), 
j=1 


and these equations can be solved successively for Apy1, Br_ps1 3 Aps2, ete. 
We take n sufficiently high, so as to have p**"*? = 0(%).° For such a value of n 
we will have 

(5) P~Q™ =0(M). 


_ “Since 9f = [q;, p*] and q, is a primary ideal belonging to , also 9f is a primary 
ideal belonging to p- 


11 


ons 
nt 
if 
the 
any 
er 
n/é 
> 
not 
the 
not 
rm 
ns, 
the 
on- 
m)> 
m)> 
n} 
0) 
the 
of 
ere 
is 
at 
ay 


162 OSCAR ZARISKI. 


This implies PQ“ ==0(q:). Now the subform yx-p of Y™ is not divisible 
by y and, by our choice of the variables z, y, we have v(y) > v(x). Hence 
v(Q™) = v(z*?), and consequently, if dx-p is any form of degree k —p in 
z, y, we have v(¢x-p)= v(Q"). Consequently, = v(P™Q™ ), and, 
since =0(qi), also =0(qi). This congruence holds true 
for any form ¢x-p of degree k — p, whence = 0(qi). Since the sub- 
form Of is of degree k, we have P™ = 0(p*), consequently, 
since Wf = [ qi, p*], 


(6) = 0(2) 
and 
(6’) P™ php = 0(M). 


We have then, in view of (6), that if 9% contains a polynomial f whose sub- 
form f;, is divisible by y? but not by y?**, MW also contains polynomials whose 
subforms are of degree k& and are arbitrarily assigned forms divisible by 
This shows that 2(%) consists of all the forms which are divisible by a certain 
power of y, say y?, where necessarily p = k —r, if r is the dimension of 0(Y). 
This proves the first part of the theorem. 

Let 2 =: p" and let W ~q;, i.e. let qj; be the v-ideal such that 
v(W’) =v(q;). There exists such an ideal q; in the sequence {qn}, since ¥, 
a primary 0-dimensional ideal, cannot belong to all the ideals qn (whose inter- 
section , is at least one-dimensional) and since %==0(2’). We have 
= 0(A)= 0(91), whence = 0(q;), since v(qjp") = v(W’p") =} v( gi). 
We assert that q;p" is also contained in the ideal p*, i.e. qj is contained in 
p*". In fact, assume the contrary. There will then exist in q; a polynomial 
F=Fo+ Fo, whose subform Fo is of degree k —1r,i.e.6 <p 
The polynomial z*-*F belongs to qi, since qjp’=0(q;) and k—oa>r. It 
also belongs to p*. Hence 2*--F =0(%), and this is impossible, since the 
subform z*?F of z*-°F is of degree k and is at most divisible by y’, o <p. 
As a result, we have qjp" =0(qi) and qjp" =0(p*), whence 


(7) =0(2%). 


On the other hand, let f be any polynomial belonging to %, and let us 
first assume that its subform f;, is not divisible by y’*?, fx = y?Yx-p, F9(Y): 
As above, we determine the polynomials P™ and Q™, given by (3), so as t0 
satisfy (4), and we again choose n sufficiently high so that p**"*? = 0(). Then 
the congruence (5) holds true and consequently also (6’), whenceP“” = 0(2) 
=0(q;), since k—p—r. Moreover, if n is sufficiently high, we will als 
have p"**+1 == 0)(q;p"). For such a value of n we deduce immediately from (4) 


a 


isible 
Lence 
-p in 
and, 
true 
sub- 
ently, 


gub- 
whose 
ny 
artain 


that 
ice Y, 
inter- 
have 
(qi): 
ed in 
omial 
r<p 
yr 
e the 


POLYNOMIAL IDEALS. 163 


that f==0(q;~"), since we have just seen that P™ belongs to q; and since 
Q” = 0(p"). Thus, we have shown that every polynomial f in % belongs to 
qj", provided the subform of f is not divisible by y’*t. If the subform fy of f 
is divisible by y**, we consider in % a polynomial f—=fxe+-:--+, such that 
fu By the preceding result, we have f =0(qjp") and also 
f+ f= 0(q;~"), whence again f=0(q;p"). It is therefore proved that 


a= qi"), 
and comparing with (7), we deduce 
and this proves our theorem. 
The following consequences can be drawn from Theorem 3: 


Coronary 3.1. YW cannot admit a factor with o >r (r=k—p), 
i.e, if WU = pM, is a product representation of A, then or. In fact, the 
subforms of 9% form then a system 2(%) of dimension = o. 


CoroLLARyY 3.2 (special case k=h). If qi =O(p"), qi 4 and 
if (qi) ts of dimension r, then qi = ~'q;, where qj 1s an ideal in the sequence 
{qn}, and qi does not admit as a factor a higher power of p than yp". In 
particular, if r=0, then qi does not admit factors p and every element of 
2(q;) coincides, to within a constant faclor in €, with y". 


CoroLuary 3.3. If qi =O0(p"), 4O0(p"") and if = 0), 
then 
| = p*qi. 


In fact, let [qm, p’**]. We have qm hence, by Theorem 3, 
Y= p'q;, where q; is some v-ideal of our sequence {qn} and r is the dimension 
of Since p*qi~ qm, we have p*q;=O0(qm) and also =0(p"*), 
since qi =0(p"). Consequently p*q; = =0(p'q;). Since the dimen- 
sion of 2(p*q;) is at least & and since 2(p*q;) is a subset of (YM) (in view 
of the assumption qi 0(p*t)), it follows k Sr. Now, qm Y*qi, 


v(p*qi) =0(Gm) S = v(p'q;), je. v(qi) Sv(pr*gy), 


whence =0(q;) and Since we also have p*q; = 0(p"q;), 
it follows that % = p'q; = p*qi, g.e. d. 


4. v-ideals and quadratic transformations. We consider the quadratic 
transformation 7’: 


, , 


et us 
0(y). 
as t0 
Then 
| also 
n (4) 


164 OSCAR ZARISKI. 


having at ey —0 a fundamental point, and we denote by ©’ the ring of 
polynomials in #,y’. © is a subring of ©’, and moreover © is contained in 
the valuation ring of the given valuation B, since, we have assumed v(y) > v(z), 
whence v(y) > 0. Let {9’;} be the sequence of 0-dimensional v-ideals in 0’, 
where 9’, = p’ =(2’, y’). We wish to study the connection between the v-ideals 
qi in © and the v-ideals q’j in ©’. Note that q, = p= (z,y), G2 = (y, 2’), 
whence = a’y’) =(2’), where (2’) = and =(2’y’, = v’y’. 


THEOREM 4.1. The extended ideal of an ideal q in the sequence {qi} is 
of the form aq’, where q’ is an ideal in the sequence {q’;} and where 
= 0(p"), 0(p""). Moreover, q is the contracted ideal of x’"9’. 

Proof. If f(z, y) =foe(z,y) + is a polynomial in z,y 
and fo(z, y) is its subform, then 


f(z, y) = f(2’, = fo(1, y’) + y’) 


whence, considering f as an element of ©’, we have f=0(2’), f40(2’™). 
By hypothesis, the subform of any polynomial belonging to q is of degree = h 
and q contains a polynomial whose subform is exactly of degree h. Hence 
= 2’"9’, where q’ 

We show that 9’ is a v-ideal. We have 


(8) = —hv(2). 
Let » be an element of ©’ such that v(w) 2 v(q’), and let 
7 G | 
= F(a, y/2) 


where F and G are polynomials and G(z, y) =0(p%) (since clearly the sub- 

form of G@ is of degree = a). Since v(w) 2v(q’) it follows, by (8), 

(h—o)v(x) + 0(G) 2 v(q). Let & be a non-negative integer such that 
h,=h—otk=Zo. 

We will have then 

(9) v(ar@) = v(2*q) = v(pra). 


Let p*q ~ qm, where qm belongs to the sequence {qi} of v-ideals in ©. The 
inequality (9) implies 24G=0(qn), and since =0(pt7) =0(p""), 
we have 

= 0([am, 


By Corollary 3. 3 of the preceding section it follows that 


= 0(qp*), 


a> = 


i 


POLYNOMIAL IDEALS. 165 


whence z“G is of the form 
= 2Ai(z, y) Bi (2, 


where A; =0(q), Bi =0(p*). But then B;/z* is a polynomial in v’, y’, and 
putting B’; = B;/ax* we have 


gh-oG A,B’; = 0(D’q) = 


whence w = G'/az7 =0(9’). We have thus proved that the ideal q’ enjoys the 
property which characterizes the v-ideals: if v(w) = v(q’), then o=0(q’). 

It remains to prove that 9’ belongs to the sequence {q’;}. But this is 
obvious, since q’ is necessarily a 0-dimensional ideal ® or the unit ideal. That 
the contracted ideal q of 2’"q’ coincides with q follows immediately from the 
fact that v(q) = v(a’"q’) = v(q), whence q =0(q), while on the other hand 
we must have, of course, q==0(q). The theorem is thus proved. 

The next theorem is in a sense the converse of the preceding theorem. 


THEOREM 4.2. For any ideal q’ in the sequence {9’;} there exists an 
integer h such that a2’"9/ is the extended ideal of an ideal q belonging to the 


sequence {qi}. 


Proof. If $:(2’,y’), $2(2’,y’),° +, x(a’, y’) is a base of q’ and if we 
write these polynomials in the form of quotients: $i(2’, y’) =yWi(a, y)/2”", 
with a common denominator 2”, where are polynomials in y, 
then we see that is the extended ideal of the ideal in O. 
Thus, there exist integers m such that 2’”q’ is an extended ideal of an ideal 
in. This will be true for all sufficiently high integers m, since if ’4 = 29’, 
then O’pq = 2’"*19’. Let h be the smallest possible value of m, and let q be 
the contracted ideal of 2’"q : 


(10) q=[D, = 


Since the contracted ideal of (a”") is p", it follows q==0(p"). Moreover, since 
the primary ideal q’ belongs to the prime ideal p’ = (2’,y’), we have 
a" =()(2/'q’), if n is sufficiently high. Passing to the contracted ideals we 
find p" = 0(q). Hence there exists an ideal q, in the sequence {qi} of v-ideals 
in D such that v(qn) =v(q). Let p? be the highest power of p which divides 


°If n is a sufficiently high integer, then xn+h = 0(q), whence w’n = 0(q’)- Let 
f(@,y) be a polynomial in q whose subform is f,(#,y),f, #9 Tfif=f, 
then the polynomial f,(1,u') + @’f,,,(1,y') +. belongs to gq’. It follows that also 
(f,(1,y’) belongs to hence (a’n, [f,(1,y’) 1”) = 0(q’), and consequently q’ is 
0-dimensional, or is the unit ideal, since f,(1,y’) 4 9. 


166 OSCAR ZARISKI. 


fins Gn=0(P7), Qn We have q=0(p") and q0(p""), con- 
sequently o Sh, because q=0(qn). Let O’qn = 2’%0’;, where q’;, by the pre- 
ceding theorem, is a v-ideal in Since qn ~ q, we have = v(2'"’). 
If o=h, then v(q’;) =v(q’), whence q’; =’, since both are v-ideals. In 
this case q, and q’ must coincide, since both are contracted ideals of 2*q’, and 
the theorem is proved. 

Assume o <h. We have 


(11) q=0([on; 


If f(z,y) is any polynomial belonging to [qn, p"], then f = 2’*f’(2’y’) and 
f=0(29’;). Since v(2’°q’;) = v(2"q’), it follows that f’(2’, y’) =0(q’). 
Hence f = 0(2/"q’) and consequently, by (10), f==0(q). We have therefore 
[ qn, p*] =0(q), and, by (11), it follows 


We have qn =0("), qn 4 0(p7*1) and h >o. We can then apply Theorem 3 
and we obtain q¢=Y'Qa, where qa is again a v-ideal in ©. Here r is the 
dimension of 2(q) and consequently r > 0, because p'?q, =0(q), whence 
r=h—oa>O0. We have then 


whence 0’qq = 2z"""q’. This contradicts our hypothesis that h is the smallest 
integer such that 2’"9’ is an extended ideal. Hence o < h is impossible, and 
the theorem is proved. 

If is any ideal in © and if O/H W’ s£0(2’), we shall call 
W’ the transformed ideal of AM (under the quadratic transformation 7), in 
symbols: ’ = 7(%). By Theorems 4. 1 and 4. 2, the transform of any v-ideal 
q in © is a v-ideal q’ in ©’, and every v-ideal q’ in © is the transform of at 
least one v-ideal q in $, everything referred to a fixed valuation. There may 
be more than one v-ideal in whose transform is q’. To find them, we again 
consider the smallest integer h such that 2’"q’ is an extended ideal. It follows 
from the preceding proof that if q is the contracted ideal of z’'q’, then 
T(q) =’, and from Theorem 4.1 it follows, that any other v-ideal q in 9 
such that 7'(q) —q’ must be the contracted ideal of 2’%q’, where o is some 
integer greater than h. For a given integer o greater than h, the contracted 
ideal of z’’q’ may or may not be a v-ideal in ©, but let us at any rate examine 
this contracted ideal. Let us denote it by Mo. Since 2’’q’ is the extended 
ideal of Mo and also of p%-"q, it follows 


(13) = 


h 
i? 
§ 
0 
th 
it 
if 
as 
C 
ig 
ig 
di 
0) 
id 
qj 
W 
ar 
el 
tr 
eC 
be 
th 
as 


POLYNOMIAL IDEALS. 167 


For the same reason we have v(Y%o) = v(p7"q). Let qm be the v-ideal equiva- 
lent to both Wo and pq, Gm~ U~ po"g. By Theorem 3, Corollary 3. 3, 
we have = [qm, Now, implies and since 
= we also have Consequently =0(p%"q), and 
hence, by (13), Mo — p?"q. We therefore can state the following theorem. 


THEOREM 4.3. If q’ is a v-deal in 0’, belonging to the sequence {q’;}, 
and if h is the smallest integer such that xq’ 1s an extended ideal of an ideal 
in D, then the contracted ideal q of x’'q’ is a v-ideal in D, a member of the 
sequence {qi}, and T(q) =a’. If o ts any integer = h, the contracted ideal 
of is po "q, but need not be a v-rdeal. In particular, the v-rdeals whose 
transform is the given v-ideal q/ are all of the form p%"q, 0 Zh. 


We shall regard q as the transform of q’ by T-*:q4—=T-'(q’). By the 
definition of q, the system Q(q) of the subforms of q must be of dimension 
r=0, i.e. every form in Q(q) differs from y” by a factor c, ce f. In fact, 
if it were r > 0, then q could be put in the form q —p'q, where q is also a 
v-ideal (Corollary 3.2), and we would have ©’q —<2’""q’, contrary to our 
assumption that h is the smallest integer such that 2’"q’ is an extended ideal. 
Conversely, if q is a v-ideal in D, belonging to the sequence {q;}, and if 2(q) 
is of dimension zero, then q is so related to its transform q’ = 7'(q), that 
i.e. if = then h is the smallest integer such that 
is an extended ideal. In fact, in the contrary case, q could be put in the form 
yj, r > 0, (by Theorem 4.3), contrary to the hypothesis that Q(q) is of 
dimension zero. Thus, there is a one to one correspondence between the ideals 
q’ of the sequence {q’;} and those ideals q of the sequence {qi} whose system 
2(q) of subforms is of dimension zero: to each q’ there corresponds a unique 
ideal q = T'-1(q’), and q’ = T(q). 

Let q’a, a’ be two distinct v-ideals in ©’, and let qi =7-1(qa), 
4; =T''(q’g), be their transforms in. Suppose that7 < j, whence qj; =0(qi). 
We assert that in such a case also a < B. In fact, let qi =0(p"), qi AO(p"") 
and let qj; =0(p7), whence O’q; = O’q; = and 
clearly, =h. Evidently = 0(2’q’.). Supposing that « > 8, whence 
=0(9’s), we would have 2/9’, and passing to the con- 
tracted ideals in , we would get by Theorem 4.3 p%"q;==0(q;). The 
equality is excluded, because qj; =0(qi) and qj; qi. Hence o >h, 
but then the congruence p?-'q; == 0(q;) is in contradiction with the fact that 
the Q(9;) (consisting of form of degree o) is of dimension zero. Hence our 
assumption % > 8 leads to a contradiction, and consequently it is proved that 
t<jimplies « < B. If then T-(q’;) = the indices form an ascending 


168 OSCAR ZARISKI. 


sequence, % <a It is immediately verified that a —0, a, ~2, 
Hence a; > 7, if 7 > 0. Let now q- be any v-ideal in the sequence {q;} and 
let T (q2)=9’c. By Theorem 4. 3 we have qs = p = 0, whence qs = 0( az), 
and s = a, i.e. 8 > o. Observing that the length of the ideal qs is equal to s, 
we can reassume the preceding results in the following theorem: 


THEOREM 4.4. If qa, then aj if 7=1. Moreover, if 
qs is any ideal in the sequence and if (qs), then s >a, 1. length 
of Gs > length of qo. 


5. Simple and composite v-ideals. We say that an ideal 2% in © is 
simple, if 9% cannot be represented as the product of two ideals, both different 
from the unit ideal, i.e. if % — BC, B~ (1) implies © = (1). An ideal is 
composite if it is not simple. 


THEOREM 5.1. A composite v-ideal can be represented as a product of 
v-ideals different from the unit ideal. 


Proof. Let UM be a v-ideal in 9, belonging to some valuation B, and let 
A— BC, BA(1),€A (1). Let B,, C, be the v-ideals belonging to B such 
that 8 ~ B,, C~G,. Since C=0(C,), we have = 0(¥,G,). 
On the other hand, v(M%) = v(B) + = o(B,) + = 
whence = 0(M), since is a v-ideal. We conclude that %,G,, and 
it remains to prove that 8, ~ (1) and ©, +4 (1). Assume the contrary, and 
let, for instance 8, = (1). Then 9 — ©,, whence € = 0(M), and consequently 
© since BC—0(C). We have then A — and this implies 
(*, p. 36), 8 = (1), contrary to hypothesis. 

Consider the given valuation B and the corresponding sequences {qj}, 
{q’;} of 0-dimensional v-ideals in the polynomial rings = £[z,y], = F[2’,y'] 
respectively. Let, as before, 7-'(9’;) =qa,. It is clear that all the simple 
v-ideals of the sequence {qi}, except the ideal p—q,, belong to the sequence 
{qa,}, since any ideal qi (1 ~ 1), not in the sequence {qa,}, is, by Theorem 4. 3, 
either of the form p’qa,, p > 0, qa, (1), or of the form pr, p>1. Now 
suppose that qa,, for a given j, is a composite ideal. Then we can write, by 
Theorem 5.1, qa,—=sq:, where qs,q¢ are in the sequence {qi}. Hence 
9/5 =T (qa,;) = where q’o —T (qs) and q’, (qr). We have seen 
above that Q(qa,) is of dimension zero; therefore neither q, nor q: can be 4 
power of p. It follows that q’o~ (1) and q’,+ (1), i.e. q’; is composite. 
We conclude then that if q’; is a simple v-ideal in ©’, then its transform 
a, = 7'-*(q’;) is also a simple v-ideal. Much more difficult is to prove the 


converse : 


| 


TR 


POLYNOMIAL IDEALS. 169 


THEOREM 5.2. The transform T(qi) of a simple v-ideal qi in © is a 


simple v-ideal (in 97). 


The proof of this theorem will be given in Part II of the paper (Corollary 
11.2), where we shall characterize the simple v-ideals from the point of view 
of formal power series. Here we shall use this theorem without proof. 

Let P:,P2,- be the sequence of simple v-ideals, different 
from (1), as they occur in the sequence {q;}: 


(14) *; Pi (z,y), PP. = = (y, 2”). 


Let similarly P’;, P’2,- - - be the sequence of simple v-ideals, different from 
(1), as they occur in the sequence {q’y}. By the preceding results, especially 
by Theorem 5. 2, there is (1,1) correspondence between the ideals Pj, i > 1, 
and the ideals where to P; corresponds 7'(P;)—= P’a, and P; = T-1(P’,). 
Moreover, by Theorem 4.4, if T7(Pi) = P’a, and T(P;) = P’g, then i < j 
implies a << B. Since T(P2) = P’;, we conclude with the following theorem : 


THEOREM 5.3. T(Pi) = P’i-x, 1. e. the transform of the simple v-ideal 
; by the quadratic transformation T is the simple v-ideal P’;_,. 


As a consequence of this theorem, it follows incidentally that the sequence 
{qn} contains infinitely many simple v-ideals. In fact, if we assume that any 
sequence {qn} of v-ideals in any polynomial ring contains always at least k > 0 
simple v-ideals (it always contains at least one, namely the prime ideal 
f.= p= (a, y)), and if we apply this assumption to the sequence {q’v} of 
v-ideals in the polynomial ring ©’, we deduce immediately, by Theorem 5. 3, 
that any sequence {q,} contains at least k + 1 simple v-ideals. 


6. Properties of simple v-ideals. Heretofore we have been dealing with 
a fixed valuation B and with the v-ideals in D belonging to B. Now, a v-ideal 
belonging to B may also occur as a v-ideal for many other valuations. Con- 
sider, in particular, the i-th simple v-ideal ®; for B, and let B be another 
valuation for which Pj; is a v-ideal. We assert that P; is also the i-th simple 
v-deal for B. The assertion is trivial for i—1, because PP, = p = (z, y). 
We may then proceed by induction, assuming that our assertion is true for 
t—1. Let {qn} and {fn} be the sequences of v-ideals in © for the valuations 
Band B respectively. Since P; occurs in both sequences, and since the ideals 
fn and q, are primary ideals belonging to the prime 0-dimensional ideals q, 
and §, respectively, it follows that q, = 4, =» = (z,y). Furthermore, since 
?; is simple, its system ©(P;) of subforms is of dimension 0. If ca + dy 
is the base of 2(P;), we must have v(ce-+ dy) >v(p) in B and also 


2. 
), 
8, 
if 
h 
is 
It 
| 
of 
et 
ch 
), 
id 
id 
ly 
es 
‘| 
ile 
ce 
3, 
ce 
en 
a 
m 
he 


170 OSCAR ZARISKI. 


v(cx + dy) > v(p) in B. We may therefore assume that v(y) > v(x) in both 
valuations B and B. We apply the quadratic transformation T : 2’ =, y’ = y/a 
to both sequences {qn} and {qn}. The sequences {q’v} and {q’v} of v-ideals in 
©’ = f[2’, /] belonging to the valuations B and B respectively, will consist 
of primary ideals belonging to the prime ideal p’ = (a2’,y’). The transform 
P’;_, of Pi belongs to both sequences and is the (i —1)-th simple v-ideal in 
the sequence {q’v}. Hence, by our induction, P’;-, is also the (t—1)-th 
simple ideal in the sequence {q’v}. As a consequence, ?; must be the i-th 
simple ideal in the sequence {q,}, and this proves our assertion. 

Thus, given a simple v-ideal P = P; in ©, there is uniquely determined 
an integer i, such that ? is the i-th simple v-ideal in the sequence of simple 
v-ideals of any valuation for which ? is a valuation ideal. We shall say that 
P is a simple v-deal of kind 1, by analogy with the terminology of the geo- 
metric theory of infinitely near points in the plane, where a point O“, in- 
finitely near the point O° = (0,0), is said to be of kind 1, if it is in the 
(1 —i)-th neighborhood of O®. The identity of the two concepts will appear 
from the formal power series considerations of Part II. However, already at 
this stage, the analogy appears from the fact, that while it takes 1—1 suc- 
cessive quadratic transformations to transform a point OO of kind ¢ into a 
proper point (a point of kind 1), it takes as well 1— 1 successive quadratic 
transformations to transform a simple v-ideal P; of kind i into a simple v-ideal 
of kind 1, i. e. into a prime 0-dimensional ideal. 


THEOREM 6.1. If {Pq} and {Pq} are the sequences of simple v-ideals 
in © belonging to valuations B and B respectively and if, for a given i, we 
have P, = 9 ;, then also Pg = Po for anya<1. In other words, the 1—1 
simple v-ideals which precede a gwen simple v-ideal P; of kind i in a given 
valuation B for which P; is a v-ideal, are uniquely determined by Pi, being 
independent of the valuation B. 


Proof by induction. The theorem is trivial for i—1. Assume that the 
theorem is true for simple v-ideals of kind i—1, and apply the quadratic 
transformation 7. We will have then in ©’: P’;_, =P where = 
and = Pq). Hence, by our induction, P’, = P’,, for a= 
whence for « = 2,: - -,i—1, because P, as well as is the con- 
tracted ideal of the ideal z’*P’,_, (= w™P’,_.), where h is the smallest integer 
such that 2’"P’,_, is an extended ideal of an ideal in ©. Moreover, P, = ?’, 
since P, and ?’, are the prime ideals belonging to P; and P’; respectively 
q. e. d. 

A much stronger theorem can be proved: 


| 
| 
f 
( 
a 
0 
il 
al 
th 
4; 
i 
Sh 
0) 
i 
2, 
Le 
di 
ide 


POLYNOMIAL IDEALS. gay! 


THEOREM 6.2. Under the hypothesis P; = P; of Theorem 6.1, the set 
of v-ideals for B which precede P, (divisors of P;) coincides with the set of 
y-ideals for B which precede Pj. 


Proof by induction with respect to i. For 11 the theorem is trivial. 
Assume that the theorem is true for simple v-ideals of kind i—1. Let qx be 
av-ideal for B such that P; =0(%). If q:, is simple, then it is also a v-ideal 
for B, by the preceding theorem. If gq is composite, we know by Theorem 5. 1 
that it can be factored into simple v-ideals belonging to B. Since gy, is a 
proper divisor of P;, only factor Pj, 7 <i, can occur. Let 


i-1 ’ 


We consider separately two cases: (1) (2) a, > 0. 

(1) First case: a, 0. We apply our quadratic transformation T. We 
find then 


4-2 


Since the factor P;(— p) does not occur in the factorization of qx, the system 
2(q4.) of subforms of qx is of dimension zero. Hence qx, —T7-'(q’c). Since 
also Pj; = 7T-"(P’;_,) and since P;=0(q.), we have, by Theorem 4. 4, 
?’;1=0(09’c). Now P’;_, is a v-ideal in ©’ for both B and B. Hence, by 
our induction, q’c is also a v-ideal for B. But then also q, must be a v-ideal 
in® for B, since q, = T-1(9’a). 


(2) Second case: a, >0. We now use an induction with respect to i, 
.@. we assume it has been already proved that all the v-ideals q; for B, j < k, 
are also v-ideals for B. Since in the factorization (15) the factor p occurs to 
the power «,, it follows that the system 2(q,) of subforms of 9; is of dimension 
a. Hence, by Theorem 3 (Corollary 3.2) we can write qx = p“qz, where q: 
isa v-ideal for B. Let p%'q, ~ qs. We have qx = 0(pqz) ; on the other hand, 
since v(p%-"q,) —v(qs), it follows that v(pqs) = v(p%qr) = v(qx), whence 
=0(q.). Consequently pqs. We now consider the k-th v-ideal qx 
for B. We must suppose that the factor P, occurs also in the factorization of 
i, since otherwise 4, would also be a v-ideal for B, by the preceding case 
% = 0, whence necessarily = qx, since k is the length of both ideals qx, qx. 
Let then 9 = pao. By our induction, fo is a v-ideal for B, since o < k; i.e. 
fo= 0. Hence = — Since of the two ideals qs, qo one is a 
divisor of the other, the same is true of the ideals q, and q. But these two 
ideals have the same length k, consequently q): = qx, q. e. d. 


h 
in 
st 
m 
in 
th 
th 
ile 
at 
0- 
he 
ar 
at 
a 
ic 
ils 
Ve 
1 
en 
ng 
he 
ti¢ 
2, 
on- 
yer 
ly 


172 OSCAR ZARISKI. 


zemark. The following example shows that a composite v-ideal does not 
determine the v-ideals preceding it. Let B be the valuation defined by the 
branch y = 2*/? and let B be the valuation determined by the branch y = 2’, 
We have then q, = p = (2,y), G2 = (y,27), Q3 = p”, while q, = p=q,, 

7. A factorization theorem for v-ideals. We know from the preceding 
sections that any v-ideal M, belonging to a given valuation B, can be factored 
into simple v-ideals belonging to the same valuation B. The question arises 
as to the unicity of this factorization. The unicity of the factorization may be 
a priori intended in more than one way. In the first place, we may fix some 
valuation B to which A belongs, and we may ask whether the factorization 
of 9 into simple v-ideals belonging to B is unique. We may go a step further 
and ask whether the factorization of %, if unique for a given valuation B, is 
independent of B. Finally, we may formulate the unicity of factorization of 
% in its strongest possible form and assert, that % can be factored in a unique 
manner into simple v-ideals, where we allow a priori that the simple v-factors 
may belong to different valuations. It is this strongest form of the unicity 
theorem which we proceed to prove. It will, of course, follow from this 
theorem, that the simple v-factors are v-ideals for any valuation for which ¥ 
is a v-ideal. 

We prove, however, a stronger theorem, from which the unique factoriza- 
tion of v-ideals into simple v-ideals will follow: 


THEOREM 7.1. Let %i,%.,---,% and M,,%,---,%s be two sets of 
simple v-ideals belonging to valuations B,, - By and By, Bo,: 
respectwely. If 
where the a’s and a;’s are positive integers, then necessarily k = s and, for 
proper arrangement of the indices, A; =i, a; = %. 

This theorem is stronger, because a product of v-ideals, in particular the 
power product I1%;*, is not necessarily a v-ideal.? 

Proof. Let the simple v-ideals %; and M; be of kind h; and hj respec: 


tively and let m = max.(hi,- -, hi, **, he). The theorem is trivia 


instance, let 9{, = (y,2*), Both 9f,, Of, are v-ideals, but 
WA. = (vy, py’) is not a v-ideal, because the system 2(9{,9f,) of subforms is ol 
dimension zero and its base zy is not the power of a linear form (see Theorem 3)- 

* We assume that all the ideals 9{;, 9{; are zero-dimensional (necessarily primary): 
The case of one-dimensional simple v-ideals is trivial, because any such ideal is prim 


| 
I 
( 
1 
% 
th 
p 
W 
al 
di 
| m 


not 
the 
= Ohi, 


ling 
ored 
rises 
y be 


POLYNOMIAL IDEALS. 173 


in the case m 1. In fact, in this case the ideals %; and M1; are prime zero- 
dimensional ideals, and the power products on both sides of (16) coincide with 
follows in this case from the unicity of the decomposition of a zero-dimensional 
ideal into primary components. We may then prove our theorem by induction 
with respect to m. 

The partial power products on both sides of (16) consisting of factors 
which belong to one and the same zero-dimensional ideal, must be equal to each 
other. Hence it is sufficient to prove the theorem for the case in which the 
ideals %:, MH; all belong to one and the same prime ideal, say to p= (a, y). 
The ideal p consists then, for each of the given valuations Bj, B;, of all the 
polynomials whose value is > 0. We may assume that v(x) —v(p) in any 
of the valuations B;, B;. Let us now apply the quadratic transformations 
= 2, y’ =y/z, and let T(M,) T(N;) =W’;. If we have, in the 


y— cit 


valuation B;, of - ) > 0, then W’; will be a primary ideal in 9’ = F[2’, y’] 


belonging to the ideal (2’, y’—c;). A similar remark holds for W;. At any 
rate, YH’; will be a simple v-ideal of kind hi; —1 and WH’; will be a simple v-ideal 
of kind h; —1. We must remember, however, that the transform of a simple 
v-ideal of kind 1, i.e. of p, is the unit ideal ©’. If then M1, = x p, where 
we allow now that one or both of the exponents a, %, may be zero, operating 
by T on (16) we get 


Since max.(h; —1,h; —1) =m—1, we have by our induction, k=s, 
(1 > 1), and (16) becomes 


Now a, is the dimension of the system Q(p™%.™%- - - %.%) and similarly , is 
the dimension of 2(p=%,%- - -%,%). Hence a,—%, and the theorem is 
proved. 

PART II. 


8. Algebraic and transcendental valuations. In this part of the paper 
we shall use the apparatus of formal power series in order to derive further 


and is therefore a principal ideal (f), where f is an irreducible polynomial. The one- 
dimensional factors and their exponents on both sides of (16) must be the same and 
may be deleted. 


ome 
tion 
ther 
n of 
que 
tors 
city 
this 
h 9 
or 
the 
pec- 
vial 
but 
of 
ry): 
ime, 


174 OSCAR ZARISKI. 


properties of valuation ideals in the ring of polynomials. The use of formal 
power series is clearly indicated by the fact that a zero-dimensional valuation 
of the field & of rational functions of xz and y is essentially a local property 
of the field. It is known that any zero-dimensional valuation B of rank 1, 
in which z and y have positive values, can be obtained by the following con- 
struction: We put 


A(t) y= B(t) = pil + 


where the coefficients « and 8 belong to the underlying field € and where the 
exponents a;, bj of each power series A(t) and Q(t) form a monotonic in- 
creasing sequence of positive real numbers. By substitution of these power 
series every element r of & takes a definite form 


and the valuation B is obtained by putting v(r) =c,. We may eliminate 
formally ¢ between = A(t) any y = B(t) and we may thus define the valua- 
tion B by putting y= P(x) = 38,74 + - -, where the exponents are 
again increasing positive real numbers. 

Now we may effect the substitution z= A(t), y= B(t) not only in any 
rational function r of x, y but also in any formal power series é = > ajjz'y/, 
i,j = 0, 1, j-integers, and in any quotient of such formal power series. In 
this manner the valuation B defines a valuation B* of the field &* of mero- 
morphic functions of x, y, and the valuation ring of B* contains the ring O* 
of holomorphic functions é. 

The special case in which the exponents a;, b; of the power series A(t), 
B(t) are integers, is the only one which is of interest in the classical theory 
of algebraic and analytic functions. The corresponding valuations B may be 
called algebroid or analytic, while non-analytic valuations may be referred to 
as transcendental valuations. For an algebroid valuation the power series P (2) 
is an ordinary Puiseux series, i. e. the exponents d,, dz,- - - are rational nun- 
bers with fixed denominator: 


d; = m,/n, (n,m, M2,° = 1, 


If the branch y=P(z) is algebraic, i.e. belongs to an algebraic curve 
f(x,y) =0, f-irreducible, then the valuation B is algebraic and is effectively 
of rank 2, being composed of the prime divisor defined by the prime ideal (7) 
and of a valuation of the field of rational functions on the curve f(z, y) =. 
In all cases, if B is algebroid, the induced valuation B* of the field of mero 
morphic functions is of rank 2. If, namely, we denote by = P, Pr 


i 


Ory 
be 
to 


POLYNOMIAL IDEALS. 175 


the n determinations of the power series P(x), corresponding to the n de- 
terminations of z’/", then £ = Tl (y—P;) is an holomorphic function of 2, y, 
i.e. an element of * and is indecomposable in D*. Given any element é of D*, 
we have a unique decomposition é = ££, &; #0(¢). The substitution y = P(z) 
does not annihilate €, and hence we find for é, a definite representation 
= + yor? +° ++, We define B* by putting v(é) = (p,¢,). 
It is evident that, conversely, any valuation B of & of rank 2 is algebraic, 
and if the values of w and y are positive, B can be defined by putting y equal 
to a Puiseux series in x, provided that the divisor of which B is composed is 
of the first kind with respect to ©. 
In the sequel we will have no occasion to use transcendental valuations. 
The results of Part I enable us, in fact, to prove the following theorem: 


THEOREM 8.1. Every valuation ideal in © belongs to an algebroid (and 
even to an algebraic) valuation of %. 


Proof.’ It is sufficient to prove this assertion for 0-dimensional (primary) 
v-ideals, because v-ideals possessing a 1-dimensional component can belong only 
to algebraic valuations. Let {q;} be the sequence of zero-dimensional v-ideals in 
© belonging to a valuation B. We wish then to prove that given any ideal in the 
sequence, say qn, there exists an algebraic valuation for which qy is a valuation 
ideal. The nature of our proof requires that a stronger assertion be established. 
We propose to prove that there exists an algebraic valuation B such that in the 
Jordan sequence {qi} of the 0-dimensional v-ideals belonging to B, the first n 
ideals 91, G2," Qn coincide with q:,q2,° *,4n. This we prove by induction 
with respect to n, assuming then that this assertion has been already established 
for n — 1, for any choice of the generators x, y of % and for any valuation of & 
whose valuation ring contains €[x, y]. Assuming, as usual, that x and y have 
positive values in B, we use the quadratic transformation 7’: 2’ = a, y’ = y/z, 
getting the ring D —F[2’, y’] and the sequence of v-ideals {q’;} in 0’ be- 
longing to B. By our induction, there exists an algebraic valuation B of 3 
whose valuation ring contains ©’ and such that the ideals q/;,q’2,° °°, Q’n-1 
are v-ideals belonging to B. Let {qi} be the sequence of v-ideals in © 
belonging to B. We now use the results of section 4. The n—1 ideals 
= 1, 2, -,n—41 must be members of both sequences {qi} and {qi}; 
here 7’-q’; is the contracted ideal of x’’q’;, where h; is defined as the smallest 
integer such that 2’*q’; is an extended ideal of an ideal in ©. Let 
9’; = Ga, 1,2,- +,n—1 (the indices a; are the same, since 


*Note that the proof makes no use of the Theorem 5. 2. 


mal 
ion 
rty 
the 
in- 
wer 
ate 
ua- 
are 
any 
ty’, 
In 
), 
(2) 
m- 
irve 
rely 
(f) 
0. | 
0- 


176 OSCAR ZARISKI. 


the index j of q; is its length). We also know that a, < a: ++ < an, and 


that =n. Moreover, if then qi is necessarily of the form p’q,,, 


o =n—1, and also G; is of the form p'qgz, cS n—1. We assert that 


Gi = Gi, 1, 2,- - Suppose that we know already that q; = @j, for 


all j <i ay,. If qi (or gi) coincides with one of the ideals qa,, j S a1, 
there is nothing to prove: we will have qi = qag =Qag = Qi. In the contrary 
case, we have qi = Pag, p > 9. It is not difficult to see that q; is then also 
of the form: q; = pq;, where necessarily 7 <1. In fact, let p?"qa,—~ qj (the 
equivalence being intended in the sense of the valuation B and qj; being a 
v-ideal for B). Then p?qa,==0(9;), whence qi =0(pq;). On the other 
hand it is clear that qi pq;, whence pq; =0(qi). Hence qi = pqj. Ina 
similar manner we find for q; a representation of the form qi = pqy. Since 
uw <1, we have qu = gy, whence of the two ideals q; and qy one is a divisor of 
the other (qj; or qu=0(q;) according as j=yp or p=j). Asa 
consequence it is also true that of the two ideals q; and qi, one is a divisor 
of the other. Now both q; and q; have the same length 7. Consequently 
4; =i, and this proves our assertion. Since the equality q; = 4; holds for 
+, and since @_, = n, it follows that qi, q2,- -, qn belong as 
v-ideals to the algebraic valuation B, and this proves our theorem. 

Using Theorem 8.1 and the results of sections 5, 6, it is possible to give 
a very simple proof of the well-known fact that every valuation B of & is the 
limit of algebraic valuations. Let {q;} be the sequence of 0-dimensional v-ideals 
belonging to B. We may assume that the intersection of the ideals qj is the 
0-ideal, since otherwise B itself is algebraic. In the proof of the Theorem 8. 1 
it has been shown that for any value of & there exists an algebraic valuation B; 
of 3, for which the ideals q,, q2,- - -,q, are valuation ideals. Let r/s be any 
element of the valuation ring 8 of B,r,se. There will then exist an integer 
n such that r=0(qn), 0(Qn41). This integer will depend only on For 
all values of & such that k = n + 1, the ideals qn and qn; will also be v-ideals 
for B,, and hence for all such values of k& 1/s will also belong to the valuation 
ring B, of B,. As a consequence, we have Lim (Bn B) = B, i.e. B can be 
regarded as the limit of the valuation By. ss 

Using the characteristic property of the Jordan sequence {qi} of v-ideals 
established in section 2, and the properties of simple ideals given in sections 5 
and 6, we may go a step further and gain an insight into the manner in which 
transcendental valuations are constructed. Given a simple v-ideal Px. we 
know that it determines uniquely the sequence of simple v-ideals P,, P2,---, Pi. 
which precede 1 in any valuation to which ?;,, belongs. We ask now the fol- 
lowing question: given P; (and hence given the entire sequence P ,, P»,- +, Px) 


i 
a] 
n 
of 
m 
a 
be 
X 
m 
W 
0 
tl 
co 
f 
m 
th 
we 
de 
fi 
ul 
a 
gi 
th 
of 
be 
e 
he 
of 
tio 
th 
ide 
ext 
pro 
on 
(C 


POLYNOMIAL IDEALS. 177 


in how many ways is it possible to choose P;,,? To answer this question, we 
apply / — 1 successive quadratic transformations, getting a ring © of poly- 
nomials of variables Y, Y, in which to P; there corresponds a simple v-ideal 2, 
of kind one, i.e. P, is prime and 0-dimensional, say ?,—(X,Y). Any 
maximal subideal of ?, is of the form (aX + bY, ?,? ), where a, 6 are in f 
and are not both zero. It is obvious that any such maximal subideal of ?, 
belongs to some valuation (for instance, to any valuation defined by putting 
X¥=bt+:--:,¥Y——at-+---) and is moreover a simple v-ideal. These 
maximal subideals are in (1,1) correspondence with the ratio z= a/b, i.e. 
with the places of the purely transcendental field f(z). Going back to our 
original ring ©, we see that the set of simple v-ideals Px, of kind & + 1 such 
that 21, Px, Psi belong to one and the same valuation, is in (1,1) 
correspondence with the set consisting of the elements of the underlying field 
f and of the symbol o. Starting with P, we can then construct, in infinitely 
many ways, an infinite sequence -, Px,- of simple v-ideals, such 
that, for any &, the ideals P,, Po,- - -, Pi belong to some valuation By, which 
we may suppose to be algebraic. We assert that the infinite sequence {Px} 
defines a valuation B of &%. Obviously then B= Lim By. To see this, we 
first observe, that by Theorem 6.2, the infinite sequence {P,} determines 
uniquely an infinite Jordan sequence {q;} which contains the sequence {Px} 
and which has the property that the elements of the sequence which precede a 
given P; are v-ideals for all the valuations By, k 21. It follows immediately 
that the congruence QmiQn: Gm =O0(Gn1) holds true for any two ideals qm, Qn 
of the sequence {qj}, since the ideals which occur in this congruence are v-ideals 
belonging to By, when & is sufficiently large. As a consequence the sequence 


effectively defines a valuation of %, q. e. d. 


9. Valuation ideals in the ring of holomorphic functions.’° It has 
been pointed out in the preceding section that any 0-dimensional valuation B 
of the field S — £(«, y) in which z and y have positive values, defines a valua- 
tion B* of the field 3* of meromorphic functions of x, y, whose valuation ring 
contains the ring * = f{z, y} of holomorphic functions of z,y. We have 
then also valuation ideals in ©* belonging to B*. It is clear that the prime 
ideal defined by B* in D* is the 0-dimensional ideal p* = (2, y), i.e. the 
extended ideal of the ideal p= (z,y) in ©. Let {qi} be the sequence of 
0-dimensional v-ideals in D belonging to B, and let q*; = *q; be the extended 


* Results of this and of the following sections will be later applied toward the 
proof of Theorem 5.2. The properties of simple v-ideals derived in Part I and based 


on Theorem 5. 2 are therefore not to be used until a proof of this theorem has been given 
(Corollary 11.2). We may, however, use Theorem 8.1 (see footnote on p. 175). 


12 


178 OSCAR ZARISKI. 


ideal of q; in O*. It is clear that v(q*i.1) > v(q*;) in B*, since v(q*;) = v(qi). 
Let fi, fz, °°, fx be a base of q; and let f be an element of q; not in qis:. Since 
=F, we have fr =cif (qin), coef. If then + is 
any element of q*; (é; O*), we have (c,é, + + (Q* is). Now 
+: + =c(p*), where ce f, and 


(f) p* = 0(q*ip*) = 0(O*qip) is), 


since qip=0(qis1). Hence é=cf(q*i,.), and this shows that 9*;/q*i. ~f, 
i.e. q*i,, is a maximal subideal of q*;. Hence the sequence {q*;} is a Jordan 
sequence in )*, and since v(q*i,,) > v(q*;), the sequence {q*;} is the sequence 
of v-ideals in D* belonging to B*, i.e. the zero-dimensional v-tdeals in D* 
belonging to B* are the extended ideals of the zero-dimensional v-ideals in 
belonging to B. It is of course evident that qi is the contracted ideal of 94¥;. 

The theorems of sections 3, 4,11 the notions of simple and composite ideals 
introduced in section 5 and Theorem 5.1 carry over without modification to 
ideals in D*. In order to see this, a perusal of the proof is not at all necessary. 
It is sufficient to take into account quite generally the nature of the relation- 
ship between the polynomial ideals in and the power series ideals in 9*. 
There is a (1,1) correspondence between primary 0-dimensional ideal in © 
belonging to the prime ideal p= (a, y) and the 0-dimensional ideals in *. 
The correspondence is such that to an ideal q in © there corresponds its 
extended ideal q* in D* and q is the contracted ideal of q*. In fact, let 9* be 
any 0-dimensional ideal in D*. If p is the exponent of q*, the q* possesses 
a base consisting of the power products vty), i+ 7 =p, and of a set of poly- 
nomials F, of degree < p. Hence q* = (Fa, x‘y/) and therefore q* is the 
extended ideal of a zero-dimensional primary polynomial ideal q belonging 
to p (= (2, y)). Let q* = (f:1, fx), where fi, fo, - a base of 4, 
and let q’ be the contracted ideal in D of q*. If Fis any polynomial in 9, 


k 
F=>Ddé&(z,y)fi, & ¢D*, and if we denote by A;™ the partial sum of the 
i=1 


terms of degree = n in the power series é(2,y), then the polynomial 
does contain terms of degree << n + 1, whence F fi(p"™). 
Now, if n is sufficiently high, then p"**==0(q), whence F=0O(q). Hence 


q’ = 0(q), and since q’ is the contracted ideal of q*, it follows q’ = q. Thus 
every zero-dimensional ideal q* in * is the extended ideal of one and only 


11 In the case of formal power series the quadratic transformation a’ = 2, y’ = 
leads from the ring k{wx, y} of formal power series in # and y to the larger ring of 
k{a’, y'} of formal power series in 2’ and y’. 


one 


This 
we de 


In fay 


Th 
(-d 
3101 
ides 
fini 
pow 
a li 
hety 
(17 
satis 
a7’ 
The 
syste 
to 
that 
belor 
inver 


POLYNOMIAL IDEALS. 179 


one primary ideal q in *, and q is the contracted ideal of q*. In particular, 
q* is simple or composite, according as q is simple or composite. 


10. The notion of a general element of an ideal of formal power series. 
The ring ©* of holomorphic functions of ~ and y contains only one prime 
)-dimensional prime ideal, namely the ideal p* = (a, y), and every 0-dimen- 
sional ideal 9{ in * is necessarily primary and belongs to p*. Since any 
ideal in O* has a finite base, given a 0-dimensional ideal Y%, it belongs to a 
finite exponent p, i.e. p*?==0(%). In other words, %f contains all the formal 
power series which contain terms of lowest degree = p. Since % is at any rate 
a linear f-module, it follows that the condition in order that an element 


f= > aij, x‘yi of D* belong to A is expressed by linear homogeneous relations 
i,j20 
between the coefficients a;;, -+ 7p. With every linear relation 
jo jaa? 


(17) 


satisfied by all the elements é of %f, we associate the function 


(17’) E =) cyja-ty i, 


The set of all the functions # obtained in this manner is called the “ inverse 
system ” of the ideal (Macaulay,®:® Lasker*). If - -, H, belong 
to then also is in From the fact 


that if Sa;;x‘y/ belongs to the ideal also 
= Sart yi and yé 


belong to 9, it follows immediately that if H = c;j;a~‘y-J is an element of the 
inverse system, then also the following relations are true for any element 


f= in W: 


. 
D> = 0, > C4, == 0. 


This shows that the inverse system %- becomes an D*-module, provided that 


we define multiplication as follows: 


S = D> toy a, B= 0. 


=0) i=a 


In fact, with this definition of multiplication by power products r%y8, the above 


180 OSCAR ZARISKI. 


equations 3¢;,1,;4ij = 0, = 0 signify that if is in also and 
yE are in M%*. Multiplication of H by any formal power series in * is then 
to be defined formally by the requirement of the distributive law of multi- 
plication. It is then not difficult to see that %* consists of those and only 
those functions E which have the property that EF = 0 for any element é in Y, 
It is evident that if MW and B are ideals in O*, then M9-' — B" implies A — ¥. 
Moreover, it is not difficult to show that given any set © of functions FZ which 
is an *-module and for which there exists at least one element = 0 in * 
such that £# = 0 for any F in ©", then the set of all elements é satisfying this 
condition forms an ideal Mf, and ©-* is the inverse system of YW. 

We now come to the definition of a concept which shall be very useful in 
the sequel. Let += t;jr‘y/ be a formal power series whose coefficients 1; 
belong to an extension field & of f. We shall say that + is a variable element 
of an ideal A, if the coefficients ¢;; do not all belong already to fF (so that at 
least one of the ¢,;’s is transcendental with respect to f) and if they satisfy 
(in &) all the linear relations (17) (with aij replaced by t4;), which are 
satisfied by the coefficients of all the actual power series belonging to 2. In 
other words, we require that r# —0 for any function Z in M-'. We shall say 
that a variable element + of 9% is a general element of M, if the ¢i;’s do not 
satisfy any other algebraic relation, algebraically independent of the above 
linear relations. Finally, we shall say that a variable element 7 of W isa 
quasi-general element of 9%, if the ¢;;’s do not satisfy linear relations other than 
those which are satisfied by the coefficients of a general element of 2. 

In a similar manner we define the notions of a variable element, of 4 
general element and of a quasi-general element of the inverse system %f". 
The condition that a function 7 = Xc;;2-*y-) belong to Y-' is expressed by a 
certain set (8) of homogeneous linear equations between the ¢;j’s. These 


equations are obtained by expressing the fact that Fé; —0, i= 1, 2,-- -,h, 
where é,, & is a base of the ideal 9%. A function = Xe; where 


the e;; are elements of an extension field €(e;;) of £, shall be called a variable 
element of A-1, if the coefficients e;; satisfy all the relations of the set (8), 
i.e. if HE =O for any é in Y. If the coefficients e;; do not satisfy algebraic 
(or linear) relations algebraically independent of the linear relations (9). 
then # shall be called a general (or quasi-general) element of 2%. 

Let = and + = be variable elements of 9-* and of 
respectively, where we assume that the e;; and ¢;; are elements of one and the 
same field &. 

We have 0 for any element é = of Since the ajj’s ate 
arbitrary elements of underlying field £ subject to the only condition of satis 


fy 
of 
in 
rel 
sat 
ass 
var 
im 
an 
an 
gar 
the 
all 
fielc 
the 
bee 
= > 
ele 
ideal 
If + 
then 
tion 
EY” 
=> 
fact, 
prod 
the ¢ 
by th 
lineay 
equat 
thege 


POLYNOMIAL IDEALS. 181 


fying a finite set of linear relations %c;;ai; = 0, where Xcija-'y/ is an element 
of M-*, it follows that the relation £H = 0 remains true if we regard the a4; as 
indeterminates connected by the linear relations 0. But then the 
relation €H = 0 holds also after the specialization ai; — t;j, since the ¢;; also 
satisfy all the relations Scijtij = 0. We conclude that rH =0. The following 
assertion is now easily derived: 

If E is a quasi-general element of U-1, then rH =0 implies that + is a 
variable element of UA, and if + is a quasi-general element of A then sH =0 
implies that E is a variable element of %-*. Proof straightforward. 

Let be two (distinct or coincident) ideals in O* and let 7’ = 30’; 
= be variable elements of and respectively. Let = 
and = £(¢’’;;). We consider a pure transcendental base {uw} of over f 
and we adjoin the elements of this base to the field 8’, adjunction to be re- 
garded as a pure transcendental extension of 8’. We obtain in this manner 
the field #’({w}) having €({w}) as subfield, and we then adjoin to #’({u}) 
all the elements of R” which are algebraic with respect to f({u}). In the 
field R = €(t’;;, ¢’:;) obtained in this manner, any algebraic relation between 
the ¢’;; and the ¢’’;; must be a consequence of algebraic relations between the 
;;’s alone in §’ and the ¢;; alone in R”. Now that the ¢’;; and the ¢’;; have 
been properly imbedded in a common field, we form the product 1/7” =7+r 
= ti; eR, and we refer to as the direct product of the variable 
elements 7’, 7” of W and W’. 

THEOREM 10.1. The direct product +r = 71’r” of variable elements of two 
ideals W’, W’’ is a variable element of the product WA” of the two ideals. 
If 7’, 7” are general or quasi-general elements of W and W”’ respectively, 
then + is a quasi-general (not necessarily general) element of WW’. 


Proof. Let WW’ =%. It is well known (and is of immediate verifica- 
tion) that — 9/1: i.e. consists of all functions such that 
EW’ Let then Ey = be an element in and let 
We assert that 7’E, is a variable element of W-'. In 
fact, consider any element = of Since LW’ eA, the 
product = is an element of the inverse system Hence 
the coefficients o;;‘°’ must satisfy the linear relations (8”) which are satisfied 
by the coefficients of the general elements of 2{’-*. The coefficients o;;°° being 
linear forms in the coefficients ai; of &’, we get by substitution into the 
equations (8”) a set of linear homogeneous relations between the a”;;._ Since 
these relations hold true for the coefficients a”’,; of any element é” of YW”, 


182 OSCAR ZARISKI. 


it follows that they are also satisfied by the coefficients (’%; of the variable 
element 7” of YW”. But then it follows that the coefficients o;; of 7H must 
satisfy the linear equations (8”), whence 7”H, is a variable element of /~, 
Since 7’ is a variable element of %’, it follows by a previously proved result, 
that 7’r”H, =0, i.e. Since the relation r#, holds for any 
element HZ, of 8, it follows that r is a variable element of 8, and this proves 
the first part of the theorem. 

Now suppose that 7’ and 7” are general (or quasi-general) elements of the 
ideals 9’ and 9%” respectively. To prove that in this case 7 is a quasi-general 
element of 8 we have to show that if Scjjti; 0 is any linear homogeneous 
relation between the t;;, then H = Xc;j;2-‘y~4 belongs to the inverse system B". 
Now the ¢;; are bilinear forms in the two sets of coefficients ¢’;; and (”;;. 
Substituting these bilinear forms we get Scjjtij = H(UVi;, Gj), where H isa 
bilinear form in the and the ¢”;;.. The relation H (t’;;, = 0 must be 
a consequence of the linear relations between the ’;; and the ¢”;; separately, 
since 7 is the direct product of 7’ and 7”. Now 7’ is a quasi-general element 
of W’. As a consequence, any linear relation between the ?’;; arises from a 
function EH’ in W’-* and is therefore not destroyed if each ?’;; is replaced by 
t’;_1,; or by t’;,;-1 (multiplication of E’ by x or by y respectively). Hence the 
relation H(t’i;, = 0 implies the relations H(t’i+,;, = 0 and 
H(t’i,j5-+,0’%j) =0. It is clear that these two relations correspond to the 
relations Scjjti_1,; = 0 and which therefore must be true rela- 
tions between the ¢;;. As a consequence the functions H = c;j;x~‘y) corre- 
sponding to the various linear relations Sc;jt;; = 0 between the ¢;;, form an 
®*-module. From this it follows immediately that Hr = 0, for any in this 
module. Now Hr = Fr’r” = 0 implies that Hr’ is a variable element of 2”, 
since 7” is a quasi-general element of YM”. The condition of belonging to 2” 
is expressed by a certain set (8”) of linear equations, and thus the coefficients 
of Hr’ must satisfy these equations. Since 7’ is a quasi-general element of ©, 
these linear equations (&”) must be satisfied by Fé’, &—an arbitrary element 
of W’, and hence FW eA’. As a consequence = B+, which 


proves our theorem.?” 


12That 7 need not be the general element of 9{’9{” is shown by the following 
example. Let p= (#,y), q= (#*,y*), so that pq=p’*. The general element r’ of 
p is +..., where the t’; are indeterminates. Similarly 
2 ” 2 ” 3 
is the general element of g. Now 


where the satisfy one non-linear relation = t,,t,.- 


ele 


is 


Hoy 
theo 


into 

series 
assun 
indep 
to be. 
which 
defini 
whenc 
value 

the fo 
of the 
for 


su 
ele 
of 
wl 
ide 
COE 
tra 
of 
then 
irred 


POLYNOMIAL IDEALS. 183 


The preceding theorem implies that if an ideal 8 is composite, then a 
suitable quasi-general element 7 of 8 (namely the direct product of the general 
elements of the factors of 8) is reducible in the algebraic closure of the field 
of the coefficients ¢;; of +. We are interested in the question of the extent to 
which the above considerations can be inverted. What can be said about an 
ideal B, if tts general element + ts reducible in the algebraic closure of the 
coefficients of 7? That the ideal 8 need not in general be composite, is illus- 
trated by the example 8 = (2, y*). The ideal B is simple, but its general 


element 
7 = toot” + tooy? + taot® + (too = tro = tor = = 0) 
is reducible, since 
r= + V— toy +) (V toot — V— 


However, in the special case of valuation ideals we can prove the following 


theorem : 


THEOREM 10.2. Jf % is a valuation ideal and tf the general element t 
of M is reducible, t = t,t.- - -t, (in the algebraic closure of its coefficients), 
then M is composite; t is the direct product of its irreducible factors ti, each 
reducible factor t; is the general element of a valuation ideal Mi, and 


Proof. Let and let =1t,t.- - be the factorization of ¢ 
into irreducible factors t; = ti(2,y), belonging to the ring of formal power 
series of x, y with coefficients in the algebraic closure of F(- - -tij-- +). We 
assume of course that no ¢; is a unit, i.e. that ¢; does not contain a term 
independent of z and y. The valuation B to which %& belongs can be assumed 
tobe algebraic. Substituting into ¢ and into ¢; the Puiseux expansion y = P(z) 
which determines the valuation B, we are able to attach to these elements 
definite values v(t), v(ti). It is clear that v(t) =v(M). Let v(ts) = a, 
whence v(#) =a, + a,-+-+++-+-a%. There exist elements in f{z, y} whose 
Value in B equals a;: they can be obtained by specializing the coefficients of 
the formal power series ¢; in such a manner as not to annihilate the coefficient 
of the leading term 2 of t;(a, P(x)). As a consequence there exists a v-ideal 
for B such that =«a;. Since > 0, no is the unit ideal. Since 
v(t) = v(H;), it follows that t; is a variable element of %;, and hence ¢ = It; 


184 OSCAR ZARISKI. 


is a variable element of the ideal %,9.---%,. But since ¢ is a general ele- 
ment of A, necessarily - On the other hand we have 


= v(t) = = — v (A, ). 


Hence 1%; == 0(M), and consequently - - 

Let 7; be the general element of Mf; and let us consider the direct product 
By Theorem 10.1, 7 is a variable element of MW. On the 
other hand, any algebraic relation between the coefficients of the power series + 
leads also to a true relation between the coefficients of ¢t, since ¢ is obtained 
from 7 by the specialization +; > ¢;. Consequently r must be a general element 
of A. It can be then identified with ¢, and it is thus seen that in the original 
factorization ¢ = IIt;, each ¢; is a general element of Mf; and that the product 
is a direct product (unicity of factorization of ¢ into irreducible factors). The 


theorem is proved. 


CorotLary. The general element of a simple v-ideal is absolutely trre- 
ducible (i.e. irreducible in R{x, y}, R being any extension field of f). 


11. The characterization of simple v-ideais. Let 


k 

4=1 j=0 
be a Puiseux series, in which we assume that (v, %,° *,%) =1, gSk+l, 
and that the first k coefficients c,,- - -,c, are in the underlying field £, while 
the remaining coefficients are indeterminates. 


THEOREM 11.1. Given a formal power series E(x, y) = Saijxtyi with m- 
determinate coefficients ai;, there exists a set of linear forms F'm(aij), Gn(aij) 
which have the following properties: (1) the relations F(a; ) = 0(ai; ef) 
give necessary conditions that the equation &,(a, y) = Sai xtyi = 0 admit a 
uniformization of type (18), the t;’s being replaced by special values t;° in; 
(2) the relations Fm(aij) =0 and the inequalities Gnr(aij) AO gwe 
necessary and sufficient condition in order that & (x,y) =0 admit the above 
uniformization and that &(2z,y) be an irreducible element of €{x,y}. The 
set of all elements é in E{x, y} whose coefficients ai; satisfy the relations Fm =, 


is a simple ideal. 


Proof. The theorem is true for g = 0, in which case v1. In fact, let 


by 
li 
8 
fro 
y 
(n 

If 
| 
the 
(20 
Sul 
(20 
whil 
(21 
The 
y= 
(4, 
n+ 
Cons 
| 
| 


POLYNOMIAL IDEALS. 185 


j= —0, if k =0). If the equation €(z, y) —0 is uniformizable 


i=1 
by the expansion y = y;, then we must have é(a, y,) = 0, and this shows that 
é(z, 7) is divisible by z=, This condition is expressed by a certain set of 
linear homogeneous equations F',(ai;) 0 between the coefficients of y)- 
Should moreover é(z, y) be irreducible, then we must have 


E(x, y) = (y— 41) Y); 


where (x,y) is a unit. As a consequence, the coefficient a), must be different 
from 0. Conversely, if Pim(aij) and a; ~ 0, then y) is divisible by 
y— y, and is irreducible in F{z, y}. 

We assume that the theorem is true for g—1. Let vy—yn, = 
(n,@’,) =1. We put 
(19) y= + 9). 


If E(x, y) = Saijxty/ can be uniformized by the expansion y = y,, then é must 


be divisible by the product [J where = are 
i=1 


the conjugates of y,. From this it follows immediately that é cannot contain 
terms x¢y/ in which wt + a,j < va, whence 


(20) ai; =0, for all 1,7, such that vi + aj < va. 
Substituting (19) we find, in view of (20), 
(20’) E(x, y) = 9) 


while the equation of the branch (18) becomes 


k oo 

4=3 j=0 
The equation é(Z, 7) — 0 admits a uniformization by means of an expansion 
¥=%, of type (21). Since a >a, there can be no constant term in 
§(Z,#). This constant term arises from the terms of -é(a, y) in which 
W+ = le. ni = na’,v,, and is therefore equal to 


Consequently 


(22) Ay,a'1,0 + ote == (), 


It is clear that (a — a, —%4,° *,%—,,v,) = 1, hence we are in the 


k ; 

v 

4 


186 OSCAR ZARISKI. 


case g —1. By our induction, we have a set of linear forms F'(a;;) and G 
which satisfy the assertion of the theorem. Now the coefficients a;; of €(Z, 7) 
are linear homogeneous forms in the coefficients a;; of €(z,y). Let {Fm} be 
the set of forms in the a; consisting of the forms #'(da;;), expressed in terms 
of the a;;, and of the left-hand members of the equations (20) and (22). Let 
moreover, {Gn} be the set of forms in the a;; consisting of the forms G(d;;) 
(expressed in terms of the a;;) and of the form a,v. We assert that the forms 
{Fm} and {Gn} satisfy the assertion of our theorem. In the first place, by 
the definition of the forms Fm, the equations Fm = 0 must be satisfied if €(z, y) 
can be uniformized by an expansion y = y; of the type (18). Assume that the 
coefficients a;; satisfy the equations F’,, = 0 and the inequalities G, 0. The 
validity of the equations (20) implies that the substitution (19) introduces in 
E(x, y) a factor @, where A = va’,;. Hence in (20’) the factor €(#, ¥) contains 
no negative powers of Z or of 7. The coefficients a;; of é satisfy, by hypothesis, 
the equations F(d;;) and the inequalities G(dij) relative to the 
branch (21). Hence the equation €(¢,%) =O can be uniformized by an 
expansion ¥ = ¥, of type (21), and consequently also the equation é(2, y) = 0 
can be uniformized by an expansion y = y, of type (18). The power series 


é(z,y) must be divisible by the irreducible element ][ (y—y.%). Let 
j=l 


=7(z,y), IL(y—y). But, by assumption 0, i.e. E(x, y) con- 
j=l 
tains a term in y’. Consequently 7(2,y) must contain a constant term ¥ 0, 


Vv 
since the coefficient of any term y/, 7 < v, in the product [JT (y— y.”’), 1s 
j= 
divisible by 2. Hence y is a unit, and € is irreducible. 
Conversely, if € = 0 admits a uniformization y = y, of type (18) and if 


4 is irreducible, we will have é=»(z2,y) II (y—y:), where y is a unit. 
j=1 


This implies in the first place aov ~ 0, and it also implies, as was pointed out 
before, that the equations hold true. The inequalities Gn(aij) 
arising from the inequalities G (dij) ~0(0 must also be satisfied, since the 
hypothesis that £ is irreducible in F{x, y} implies that é(Z, 7) is irreducible 
in F{Z, 7}. 

It remains to prove that the elements é(«, y) = 3ai;x‘y/ whose coefficients 
satisfy the linear relations Fm 0 form a simple ideal P. Let us regard the 
coefficients a,j as elements of the field defined by the equations F'm(ai;) =°. 
The inequalities Gn ~ 0 are then satisfied, and = 0 admits a uniformization 
y=y, of type (18). As a consequence also zé —0 and yé=0 admit the 
uniformization y = y,, and hence the coefficients of the two power series 


é 

j 

t 

Ir 

a 

li 
e] 

le 

q 

t 

te 

4 ti 
| 
tré 
als 
the 

des 

It 
if ¢ 
reg: 

is 

sinc 

as 
foll 
in t 

resp 

not, 

| 


POLYNOMIAL IDEALS. 187 


and yé must satisfy the relations /, —0. This implies that the relations 
Fm (4i-1,)) = 0, Fm (i,j-1) = 0 are consequences of the relations F\m(aij;) = 0, 
ie. if € is any element in P, then zEe P and ye P. Hence P is an ideal. 
To prove that ? is a simple ideal, we observe, that if ® was a composite ideal, 
then, by Theorem 10.1, a suitable quasi-general element ¢ of ® would be 
reducible in the algebraic closure of the field of the coefficients of t. Now #, 
a quasi-general element of ®, has the property that its coefficients satisfy no 
linear relations other than those which hold for the coefficients of the general 
element of ?, i.e. only the relations Fy, —0. Hence the coefficients of ¢ 
certainly satisfy the inequalities G, 4 0, and therefore ¢ could not be reducible, 
in contradiction with our assumption that P is composite, q. e. d. 

In order to apply the above theorem, we begin with some preliminary 
remarks. Let ¢ = Xt;j;a'y/ be the general element of a valuation ideal q, and 
let us assume that ¢ is absolutely irreducible (this is certainly the case if 
4 is a simple v-ideal, see Theorem 10.2, Corollary). We will have then 


t=e. (y—yi), where is a unit and where y,, y2,- yv are the de- 
i=1 (oe) 
terminations of a Puiseux series y; = > t;x‘/”, the ¢,’s being algebraic func- 
i=1 
tions of the ¢;;. Some of the coefficients ¢; may be constants, i. e. elements in f. 
Let ti) = ci, 1 = 1,2,-- +-,h, cc while is the first coefficient which is 


transcendental with respect to f. The ideal q is a valuation ideal for some 


algebraic valuation B, defined by an expansion y = 7 = > djx‘/“, d, ef, and 
i=1 


the value of ¢, i.e. the evaluation of q, is the exponent of the term of lowest 

degree in ¢(2, 7) =e. [] (y—yi). It is clear that the term of lowest degree in 
4=1 


7— yi is the same as the term of lowest degree in — 
It follows that the value of ¢ is not altered if we regard the coefficients 


in the expansion as entirely independent indeterminates. Now 
vp 

if y) denotes the direct product «. (y— yi), in which ths, tase, are 
i=1 


regarded as indeterminates and e¢ is a unit with indeterminate coefficients (i. e. 
¢is the general element of the unit ideal), then # is a variable element of q, 
since v(?) = v(q), and on the other hand 7 is at least as general a power series 
as ¢ (i.e. ¢ is a specialization of ¢). Since ¢ is the general element of q, it 
follows that ¢ can be identified with 7, and hence the coefficients ths1, ths, ° 

in the original Puiseux series y, are indeed algebraically independent with 
respect to f, and can be regarded as indeterminates. Changing slightly our 
hotation and putting into evidence the coefficients c; which are different from 


“€T0, we re-write the series y; as follows: 


188 OSCAR ZARISKI. 


where to, t,, t2, are indeterminates. Let 8 be the highest common divisor of 


v 
We assert that if 8>1, then t—eJ] (y—yi) cannot 
4=1 


be the general element of an ideal. We shall show, namely, that if § > 1, then 
the coefficients t;; of t = St,j;x'y/ satisfy non-linear relations (which are not 
consequences of linear relations). Consider another sample of the series (23), 


k oo 
y's anes > + > 
i=1 
Vv 
where the ¢’; are new indeterminates, and let = > = (y— y's), 


where ¢ is a unit with indeterminate coefficients. Also ¢’ is a general element 
of q. Assume that the coefficients ¢;; (and hence also the coefficients ¢’;;) 
satisfy only linear homogeneous relations. Then it is clear that ¢ + ?’ is also 


a general element of q, whence ¢ + ¢’ = (y—- Yi), where € is a unit and 
The substitution y = y, must annihilate t+ (’, i.e. II (y—yi) +2 Il (y—y'i). 
We proceed to express the fact that the en of the fit: to terms 
of Il +e II (¥:— vanish. If denotes a primitive v-th root 
of ditty, then 
j= 


j=l 
whence 


k 
j=1 


Let & be the h.c.d. of +, whence 0(8). We denote by 
II’(7, — yi) the product of the factors 9, — y; for which i8’ = 0(v), and 
by II”(9Y,— yi) the product of the remaining factors 47, — y:, so that 


Il’ — Il’ (4%: — (J: yi). 


If is’ 4 0(v), the differences 1—w*%, 7 =1,2,---,k, cannot all vanish. 


POLYNOMIAL IDEALS. 189 


Let o be the smallest value of j, such that 1—wi*e~0. Then, by (24), 
ca(1 — w'*7)x%0/”, g =k, is the term of smallest degree in 9, — yi, while the 
exponent of the next term will be greater than ac/v-+1/v, since, for 
+, kh, a4. — =0(8), whence a;,, — a; 28> 1. It follows that 


(25) yi) —da 4+ 0Adef, A 
Similarly, replacing the ¢; by the ¢’;, we will have 
(26) (9, — =da+djas+--- 


We now consider the product II’(y¥,— y;). Here i assumes the values 
v/8’, 2v/8’,- , &v/8’, whence 


(27) — Yi = Y — = (ro 


(7 = 1, 2,° 


where w, is a primitive root of unity of exponent 8. Taking into account that 
(8, G41) = 8 > 1 and letting = sh, we find from (27), 


It follows, by (25), that 


i=1 


The first two terms of J] (¥: — y’:) are obtained from the above by replacing 
i=1 


to by t’9. Hence we must have 


Where €o) and e¢’o, are the constant terms in e and ¢’ respectively. These equa- 
tions imply the relations +5’ = t)" = ¢’,", in contradiction with the algebraic 
independence of the ¢; and the ?’;.. This proves our assertion. 


Vv 
This result shows that the general element t—eJ] (y—y) of the 


Valuation ideal q is exactly of the type considered in the proof of Theorem 


v 
| 


190 OSCAR ZARISKI. 


11.1, and hence q belongs to the class of ideals ® defined in that theorem. 
This holds true for any valuation ideal q whose general element is absolutely 
irreducible, in particular for any simple valuation ideal. But it has been 
shown that the ideals P are simple ideals. Hence, a valuation ideal whose 
general element is irreducible is simple. The converse has already been proved 
before (Theorem 10. 2, Corollary). “Reassuming and recalling Theorem 10. 2, 


we have the following theorem: 


THEoREM 11.2. A valuation ideal P is simple if and only if tts general 

element t = Xtijxty! is absolutely irreducible. The general element t of P is 

the direct product.t =e-T] (y—yi), where « is a unit (with indeterminate 


k 

coefficients) and y, = > + = 1, the 
i=1 j=0 

t; being indeterminates. Given any valuation ideal A, the factorization of the 


general element of UX into irreducible factors yields a factorization of A into 


simple v-ideals, according to the scheme indicated in Theorem 10. 2. 
We are now in position to prove the basic Theorem 5. 2 of Part I: 


CoroLiary 11.2 (Theorem 5.2). The transform of a simple v-ideal by 


a quadratic transformation is a simple v-ideal. 


Proof. It is immaterial whether the theorem is proved for polynomial 
ideals or for power series ideals, in view of the relationship between these 
ideals described in section 9. Let ® be a simple 0-dimensional ideal v-ideal 
in D* = f{z, y}, belonging to a valuation B*, and let 2”P’ be the extended 
ideal of P in D’* = E{a’, y’}, where « = 2’, y = 7/2’ (we assume as usual that 
v(y) >v(x)) and where is the integer such that P =0(p*”), P A0(p*’””), 
p* = (x,y). Here P’ is necessarily either zero-dimensional or the unit ideal. 
Let t(z, y) = Stijxtyi be the general element of ®. By the definition of the 
integer v, we must have ¢;; —0 for all i,j such that i+ <v, and ti; 
for some i,j such that i+j—v. Hence t(2,y) =t(2’, 2’y’) 
where y’) = is a formal power series in a’, y’. Here = 
ifi+v=j, and m;—0, if i+v<j. In our special case the general ele- 
ment ¢ of ? is of the form indicated in Theorem 11.2. Hence 


7 =€(2,Y) IT (y’— y's), 


| | 

| 

| 
? 
al 

th 
th 
Th 
ge 
sal 
Us 
up 
of 
anc 
¢ 
me 

y 


POLYNOMIAL IDEALS. 191 


where 


k oo 
i=1 j=0 
If then ¢’(2’, y’) denote a general unit of D’* (i.e. a unit with indeterminate 
coefficients), then the direct product ¢’r is the general element of a simple 


ideal We have v(P’) = v(r) and 
u(r) + = v(t) = o(P), 


whence v(P’) =v(P’) and P’=0(P’). 
On the other hand, any element ¢° = 31;;°x‘y/ of P is obtained from the 
general element ¢ by a specialization ¢;; > ¢;;°. Hence, if 


then r° is obtained from e’r by the specialization 1, 74; and there- 

fore 7° is an element of the ideal P’. Since 2’’P’ is the extended ideal of 

?, a finite number of elements such as7° form a base of P’. Hence P’ =0(P’). 

and consequently P’ = P’, q. e. d. 


Fiemark 1. In view of the unicity of the factorization of a v-ideal AM into 
simple v-ideals (Theorem 7.1), the second part of Theorem 11. 2 implies that 
the factorization of the general element of %f into irreducible factors yields 


the factorization of 9 into simple v-ideals. 


Remark 2. Concerning the characterization of simple v-ideals given in 
Theorem 11.2, it is not difficult to show that, conversely, an ideal P whose 
general element t is of the type described in Theorem 11. 2, is a v-ideal (neces- 
sarily simple). For the proof we assume a, > v and we consider the transform 
® of by the quadratic transformation 2 = 2, = = 
Using the notation of the proof above, we find as above the congruence 
#’=0(’). The preceding proof of the congruence Dp’ = 0(P’) was based 
upon the fact that P’ was a v-ideal. But we may proceed without making use 
of this property of P’. Let t’(2’, y’) be the general element of P’, = 3452/4’, 
and let = 0 be a true linear relation between the If = 
isany element in P, and if 4 = ar, = then is an ele- 


ment of P’. Hence we must have 


i 
t 
) 


192 OSCAR ZARISKI. 


This relation holds for any element ¢ in , consequently we must have 
= 0, whence = 0. This last relation shows that r is a 
variable element of P’. But then, in view of Theorem 10.1, also ¢’r is a 
variable element of P’, since «’ is the general element of the unit ideal. Con- 
sequently P’ = 0(P’), whence again P’ = 9”. 
Consider the valuation B defined by the branch 


i=1 j=0 
where dy, - are arbitrary constants (in and d, ~0. Let the value of 
in this valuation be Ao/v, Ao—an integer. The integer Ay —A(P) is independent 
of the particular constants d; and is uniquely determined by the ideal ? and 
by the auxiliary condition «, = v which prevents a special position of the axis 
z=0(. If the variables 2’, y’ are used, then the valuation B is defined by 
the branch 
k 00 
i=1 j=0 

We have v(P’) = v(P) — v(2”), whence v(P’) = (A, — /v. If 
a%,—v=v, then A(P’) =rA,— <A(P). If then the roles of 
the variables 2’, y’ must be interchanged, and putting 4, —v—v’ we find 
v(P’) = [(A.—v*) /v] (v/v’), whence again A(P’) = Ay — v? < A(P). The 
inequality A(P’) < A(P) leads to a complete induction with respect to A(?). 
If A(P )—1, then v 1, since evidently A(P ) = v, and hence P = p* =(z,y), 
so that if A(P) —1, P is a v-ideal. Since A(P’) <A(P) we may assume, 
according to our induction, that P’ is a v-ideal. Now 2’’P’ is the extended 
ideal of , and from the expression of the general element ¢ of P it is seen 
that the subform of degree v of ¢ contains only the term y” (since a >»). 
By Theorem 4. 3, our assertion that P is a v-ideal, will follow, provided it is 
shown that P is the contracted ideal of x’*P’. The proof of this is immediate. 
The general element t of the contracted ideal of z’’P’ is of the form 2’”’7 (2’,y’); 
and we can assert that not only is 7 a variable element of P’ (this is obvious) 
but also that its coefficients 7;; satisfy those inequalities (see Theorem 11.1) 
which insure the irreducibility of +, because this is true for the general element 
t—2"r(2’,y’) of the ideal P. It follows that ¢ is necessarily of the form 
e(z, y) Il (y—yi), where y, => + whence is 

i=1 j=0 

variable element of . Since P is contained in the contracted ideal of 2’? 
it follows that P coincides with the contracted ideal, q. e. d. 


/ 


id. 


cor 


( 
t 
0 

Nn 

I 

d 
8) 

u 
$a 
ur 
he 
fo 
val 

to 
me 
fac 
the 

(i 

| B; 

ele 

| sim] 


POLYNOMIAL IDEALS. 193 


12. The class of complete ideals. It has been pointed out in section 7 
that a product of valuation ideals is not always a valuation ideal. The class 
of ideals (in f[z, y| or in f{z, y}) which can be factored into valuation ideals 
is therefore larger than the class of valuation ideals. We shall call ideals in 
‘complete 
of complete linear systems in algebraic geometry, these being the linear systems 


3 


this class complete ideals. The term is suggested by the notion 
which are defined uniquely by base conditions, i. e. by the condition of passing 
with assigned multiplicities through an assigned set of proper or infinitely 
near points. It will be seen that complete ideals are those and only those ideals 
whose elements are subject to given base conditions, and to no other conditions. 
In other words, the polynomials which belong to a complete ideal and whose 
degree is not greater than a given integer n, form, for any n, a complete linear 
system. 

By our definition of complete ideals, the class of complete ideals is closed 
under multiplication. Moreover, by Theorem 7.1, a complete ideal has a 
unique factorization into simple complete ideals, these last ones being neces- 
sarily valuation ideals. Our next aim is to prove that this class is also closed 
under the other ideal operations ([ ,]), (: ) (intersection and quotient) ; not 
however under addition (+).* The theorem which we wish to prove is the 


following : 


THEOREM 12.1. Jf MW and B are complete ideals and © is an arbitrary 
ideal, then [M, B] and MX: C are complete ideals. 


This theorem is an immediate consequence of the following: 


Lemma. Any complete ideal is the intersection of valuation ideals, and, 


conversely, the intersection of valuation ideals is a complete ideal. 


Proof of the lemma. Let = where each is a 
valuation ideal for some valuation B;. We may assume that the %; belong 
to one and the same prime ideal p= (z,y). Denote by ¢ the general ele- 
ment of 9% and let t= t,t.- - -tm be the factorization of ¢ into irreducible 
factors in the ring R{z,y}, where & is the algebraic closure of the field of 
the coefficients of ¢. Let a; v(t) be the value of ¢; in the valuation 
((=1,2,---,m; 7 =1,2,---,k), and let be the v-ideal belonging to 
B; such that v(%i;) Since in Bj, ti is a variable 
element of W5;, whence ¢; is also a variable element of the intersection 


*Example. The ideals (y?, p°), (#, y*) are v-ideals, but their sum (join) is the 
simple ideal (x, y?, p°), which is obviously not a v-ideal and consequently not complete. 


13 


a 
a 
)- 
it 
d 
ig 
y 
{ 
d 
le 
), 
), 
) 
| 


194 OSCAR ZARISKI. 
= (Wir, Mix], (t—1,2,---,m). 


Let 7; be the general element of B;. We have v(7;) S v(t) in Bj, since all 
the linear relations between the coefficients of 7; are also satisfied by the coeffi- 
cients of ¢;. On the other hand, we have in any of the valuations B,, 
v(ri) = 0(Bi) 2 v(t). Hence = v(t). Let r= 
be the direct product of the elements 7;. The elements ¢ and + have the same 
value in each of the valuations Bj, whence v(r) =v(M%) = v(M;) in Bj. This 
shows that 7 is a variable element of %;, whence 7 also is a variable element 
of the intersection 9% of the W;. But 7 is not less general than ¢ (since 7; is 
the general element of the ideal 8;, of which ¢; is a variable element, and since 
7 is a direct product of the 7;) ; consequently, since ¢ is the general element 
of W, also 7 is the general element of %. We may then identify ¢ with +, and 
we deduce that t is the direct product of its wrreducible factors t, and that 
t; is the general element of B. 

We now apply the considerations developed in the proof of Theorem 
11.2. For the irreducible element ¢; we have the following uniformization: 


te = y) TL (y—ya®), where 
h=1 


k 


j=0 


rio—transcendental with respect to field f. 


Since the inequalities v(t) = v(M;) in Bj, 7 =1,2,- - -,k, are the only con- 
ditions which should be imposed on ¢, it follows that all the 74; are indetermi- 
nates, as the value of ¢; in any algebraic valuation is not altered if we regard 
the 7;; as indeterminates. Also e;(z,y) is to be regarded as a unit element 


Vi 
with indeterminate coefficients, and the product of «: by II (y— yx“) is to be 
i=1 


intended as direct. But then we must have necessarily (@i1, @i2,°°* V4) =! 
and consequently ¢; is the general element of a simple v-ideal (by the pre- 
ceding section, Remark 2). Hence 8; = 2“, where P‘ is a simple v-ideal. 
By Theorem 10.1, t,t2:--tm is a quasi-general element of the ideal 
Pim, But the coefficients of the direct product i.e. of |, 
satisfy only linear relations, since ¢ is the general element of &. Hence ¢ is also 
the general element of ---P™, and consequently A =P --- P™, 
This proves the first part of the lemma. 

To prove the second part of the lemma, let 2% = %,%.- - - Wn be a com- 
plete ideal, where %,, %2,- - -,%m are simple v-ideals, which we may assume 
as primary ideals belonging to one and the same prime ideal p= (z,y). Let 


Noy 


le 
b 
by 
p 
of 
in 

L 
wh 
(2 
vh 
id 

ig 
We 
wh 

alsc 

the 
of 

its 
trac 

H 


we 


POLYNOMIAL IDEALS. 195 


Y, be of kind hj, and let h = max(hy,hz,---,hm). The assertion of the 

lemma is trivial if m1. We assume that the assertion is true for all 

integers h’ such that h’ << h (if h =1, then necessarily m=—1). Let 

belong to a valuation B;. Assuming that v(y) = v(2) in each of the valua- 

tion B;, we apply the quadratic transformation T, 2’ =a, y’ =y/« to the 

polynomial ring D = f[2, y], getting the ring 0 = F[2’, y’] of polynomials in 
m 

vy’. Let = whence O/H = - - Wm, p= pj, where 
j=l 

= 0(pes) and A0(p*!). By Theorem 4.1, W’; is a v-ideal in 9’, and 

by Theorems 5. 2 and 5. 3 W’; is a simple v-ideal of kind h; —1. The product 

p 

| W’; can be first written as the intersection of the partial products consisting 

i-l 

of factors Y{’; which belong to one and the same prime ideal. Then, by our 

induction, each partial product can be written as the intersection of v-ideals. 

Let then 


whence 
where B’,, B’,,- - -, are v-ideals in ©’. 


Let o; be the smallest integer such that 2’7'Q’; is an extended ideal of an 
ideal in D, and let B; be the contracted ideal of 2’B’;. By Theorem 4. 3, Bi 
isa v-ideal and = 7-1(B’;). We have = 2B’; and O/A = A’. 
We assert that oi =p. In fact, assuming that 0; > p, we have 


whence, passing to the contracted ideal of =0(B,). Now 
also B; is in and not in the congruence = 0(B,;) implies that 
the subforms of degree o; of the polynomials in %; form a linear system 2(B;) 


of dimension = oi—p > 0. This is impossible since, by our definition of 
¥i(— T*(B’;)), the system 0(%;) is of dimension zero. 

We have therefore o; Sp, and consequently 7°B’,; is an extended ideal, 
its contracted ideal being p?-7'B, (Theorem 4.3). Denoting by %* the con- 


tracted ideal of O/H, we obtain from (28): 


Now B; is a valuation ideal for some valuation, and in that valuation the 


| 

t 
t 
d 
it 
)e 
1 
al 
t, 
et 


196 OSCAR ZARISKI. 


ideal p’-°'B, is equivalent to some valuation ideal ©. By Corollary #.% we 
have p?-7'B, = p?], whence 


— [C,,6.,-- -, Cp’). 


Hence Y%* is the intersection of valuation ideals.1* By the first part of the 
lemma, just proved, 9%{* is a complete ideal, 


where the %*; are simple v-ideals. %& and %* have the same extended ideal 
If then T(M*;) W’*; we must have 


By the unique factorization theorem (7.1), it follows, for a proper ordering 
of the factors, m =n, W’; = W*;, whence also WM; = A*;, A — A*, and con- 
sequently 9% is the intersection of v-ideals, q. e. d. 

Theorem 12. 1 now follows immediately. That [%, 8] is a complete ideal, 
if M and B are complete ideals, is now trivial, since, by our lemma, complete 
ideals can also be defined as intersections of valuation ideals. Consider now 
the ideal 4%: €, where % is a complete ideal and © is an arbitrary ideal. We 
have [%,, %.,- --,%], where the are valuation ideals, whence 
A: = [M%,: C, W.:C,- - But by Theorem 2. 1, the quotient 
is again a valuation ideal. Hence %:€ is the intersection of valuation ideals, 
and therefore is a complete ideal, q. e. d. 


CorotiaRy 12.2. If Mis a complete ideal (in €[2, y] or in E{x, y}) and 
if «/°W is the extended ideal O/H in the ring E[2’,y’] or E{a’, y’}, where 
a =x, y =y/z, then W is a complete ideal and % is the contracted ideal 
of 


That 9’ is a complete ideal is trivial. The second part of the corollary 
follows from the relation 9% — %* established in the course of the proof of the 


second part of the lemma. 


13 Any power of p is a valuation ideal. Consider the 0-dimensional valuation 
defined by the divisor (#’) and by the point y’=0 of this divisor, i.e. if F(a’,y’) 38 
any rational function of a’, y’, then put v . (m,n), where w’m is the highest pow 
of which divides F, F =  (a'y’), F,(0,y’) 49, and where is the highest 
power of y’ which divides F, (0, ). Then it is easily seen that the sequence of v- -ideals 


it 
tl 
| 
f 
al 
is 
be 
al 
CO 
ey 
fir 
th 
th 
ple 
to 
id 
sys 
sp 
con 
is 
¢ 
ope 
the 

| in 


al 


POLYNOMIAL IDEALS. 197 


When a linear system of curves f 0 is subjected to base conditions, 
it is required that the curves of the system have assigned intersection multi- 
plicities with an assigned set of algebraic branches y;. For each branch y,; 
the corresponding condition is equivalent to the condition that the value of the 
polynomial f in the valuation B; defined by the branch y; be not inferior to a 
given integer; in other words: it is required that f==0(%;), where %; is a 
given valuation ideal belonging to B;. The full set of base conditions is then 
described by the congruence: f = - -]), i.e. by the condition that 
f belong to a given complete ideal. However, the representation of a complete 
ideal as an intersection of valuation ideals is not unique. For instance, let 
Y= (ry, p>), where p= (a, y). not a valuation ideal, since it has only 
one subform zy and this is not a power of a linear form. Let M1, = (2, xy, p*), 
(zy, p?), = (2, x, zy, = (y, zy, y?, p?). These 4 ideals 
are valuation ideals and we have % = [%,, %.] = [%, U2]. This ambiguity 
is the algebraic equivalent of the distinction which the geometric theory makes 
between the assigned or virtual multiplicities of the curves of a linear system 
and the effective multiplicities of these curves. It is the representation of a 
complete ideal as a product of valuation ideals that is unique and puts into 
evidence the effective multiplicities of the general curve of a linear system.* 

We point out explicitly the following result arrived at in the course of the 
first part of the above proof: the general element t of a complete ideal A is 
the direct product of its irreducible factors, and the factorization of t yields 
the factorization of MX into simple v-ideals. This is the generalization to com- 
plete ideals of the similar property of valuation ideals given in Theorem 11. 2. 

We conclude this section with the definition of an operation which assigns 
to each ideal 2 in © a uniquely determined complete ideal YW’, the complete 
ideal determined by %. The analogue of this operation in the theory of linear 
systems is given by the passage from an arbitrary linear system to the corre- 
sponding complete linear system. We define W as the intersection of all the 
complete ideals containing M. Since WM has a finite length, it is clear that 2’ 
is also the intersection of a finite number of complete ideals, hence W itself is 
4 complete ideal, i.e. W is the smallest complete ideal containing A. The 
operation comes under the heading of the (’) operations studied in other con- 
nections by van der Waerden (°, § 103), Priifer*® and others, since it enjoys 
the following formal properties: 


in consists of all ideals of the form (adypo-\, wd-lyp-\+1,. . yp, pert), 
P2=\20, p=1,2,3,.... For \=p, we find the ideal p?. 


*See Note at end of Section 12. 


e 
vg 

n- 
1, 
te 

Je 

ce 
¢ 

ls, 

id 

Ty 

he 

jon 

js 
ver 
est 
als 


198 OSCAR ZARISKI. 


3. (Ws, = (%, We)’. 4. (W M2)’ = 
5. (a)’—= (a); ((a)-W)’ = (a) 


Proof. 1. Trivial, because %’ is a complete ideal itself ; 

2. Self-evident; 

3. By 2, (%, D> W2), whence We)’ D (Wi, 
On the other hand whence W2)’ 2 M2)’, by 2. 

4. By 2, we have (W:%’2)’ 2 (WW2)’. Let B be any valuation 
ideal belonging to some valuation B and containing %,%., and let %j, 
i 1,2, be the valuation ideal for B which is equivalent to Wi. We have 
v(B,B.) = v = v(B), whence B,B. = 0(B). Since = 0(B,) and 
%, is a valuation ideal, hence complete, we have WW’; =0(%B;), consequently 
W’, 2’, = 0(B,B.). Since every complete ideal is the intersection of valuation 
ideals, it follows that (%,%.)’ is the intersection of all the valuation ideals ¥ 
containing Hence As a consequence also (2’,’.)’ 
C ((%,%.)’), whence 4 follows. 


We observe, however, that (W’,W’.)’ = W,W’., since the product of the 
complete ideals %’,, Ws is itself complete. Hence 4 can be written as follows: 


4’, NW’. (1, 


5. A principal ideal (a) is itself a complete ideal. If a = a,P1a."2- + - ay’, 
where a; (2, y) is an irreducible polynomial, then (a) m (a2)n N( a"), 
and evidently (a;’*) is a v-ideal, belonging to the 1-dimensional valuation 
defined by the divisor (a;). Hence (a)’ = (a). Ina similar straightforward 
manner the relation ((a) -%)’ = (a) W’ can be proved. 


THEOREM 12.3 (invariance of the operation (’) under quadratic trans- 
formations). If p= (x,y) and if W=O0(p?), A HA O( pt), then also 
W’ = 0(p?), W’ 0(pe'). Moreover, if B is the transform of M under the 
quadratic transformation T, then B is the transform of WM’ under T, i.e. 


O/H implies O/H’ = 


Proof. Since any power of p is a complete ideal (even a valuation ideal) 
and since Y’ is the smallest complete ideal containing A, the congruence 
= implies the congruence Moreover, if % 0(p™), 


t 
t 
t 
h 
0 
t] 
t 
I 
p 
th 
th 
id 
de 
ass 
?, 
(2, 
det 
ass 
or 
an 
0;( 
wit 
A SS 
whe 
4 


nce 


POLYNOMIAL IDEALS. 199 


also MW’ since M=0(A’). Let then — — 
where = y’] (or OD’ = F{a’, y’}), =a, y’ =y/z, and where and B* 
are not divisible by a’. Since W=0(W’), also B= 0(B*), and by Corollary 
12. 2, B* is a complete ideal. To prove that B* — B’, we have to show that 
if B, is any complete ideal containing $8, then 8* =0(%,). Let o be the 
smallest integer such that 2’°%, is an extended ideal of an ideal in D. The 
same reasoning employed in the proof of the second part of the Lemma, for 
the derivation of the inequality o; S p, can be used also now in order to show 
that op. It is only necessary to observe that by Corollary 12.2 the con- 
tracted ideal of 2’°%, is a complete ideal, say ©. The system Q(€) of the 
subforms (of degree o) of © can be of dimension greater than zero, only if in 
the factorization of © into simple v-ideals there occurs the factor p. We would 
have then © = pD, whence 0’D — x’*'B,, in contradiction with the definition 
of the integer o. Hence Q(@) is of dimension zero, and it was this value of 
the dimension of 2(%;) that played a réle in the proof of the Lemma. 

Since o S p and B, > &, we have 2’°B, - «’°B. Now let € be the con- 
tracted ideal of 2B, isa complete ideal, and its extension ideal is 2’7$,. 
Hence the extended ideal of pe is 7°, and consequently (Corollary 12. 2) 
y°€ is the contracted ideal of z/°B,, since p’-? is also a complete ideal. Since 
the contracted ideal of contains the congruence x°B, implies 
the congruence p’-*© DY. Hence pC D A’, and passing to the extended 
ideals in ’, we find 1B, — 2/°B*, i.e. B,D B*, q.e.d. 


Note.—For those not familiar with the geometric terminology we give here the 
definition of the effective multiplicities on the basis of the present treatment. We 
associate with each simple v-ideal P;,,, of kind k + 1, belonging to the prime ideal 
?,=p = (z,y), a point 0,,, in the k-th neighborhood of the point 0,(0,0) of the 
(t,y)-plane. Let P,, P.,---,P, be the simple v-ideals of kind 1,2,---,k 
determined by P;,,, and preceding it (Theorem 6.1), and let 0,,0.,- - -,0, be the 
associated points. We proceed to define the set of base points of the ideal P;,, 
or the symbol 

B( = (0,70,72- 41), r,>9, 


and we shall say that r; is the effective multiplicity of the ideal P ;,, at 
Oi(t%..—=1). We set B(P,) —(0,). For any k we define B(P,x,,) by induction 
with respect to k. We know, by Theorem 6. 2, that if B is any valuation for which 
Pin is a v-ideal, the v-ideals for B which precede Py,, are independent of B. Let 
q, be the v-ideal which is followed immediately by P;,,, and let q, = P P Py. 
Assuming that the symbol B(P;) has already been defined for all i << k + 1, we put 


where, if 
B( Pixs) = (0,%10,%- - 


n 
iy 
e 
d 
y 
n 
e 
8: 
pr, 
ok) 
on 
rd 
18- 
Iso 
he 
al) 


200 OSCAR ZARISKI. 


and 
== (0,40,%- - - 0;410;.1), 


then 
B(P;)B(P;) == - 


To check this definition against the customary geometric definition, we point out two 
implications of our definition: 


(1) ts exactly divisible by i.e. =O(p"), The 
assertion is true for k —0. Assuming that is true for P;,,, i —0,1,---,k—1 
and that P;,, is exactly divisible by p"%, then 


and from this we conclude, in view of q, = P,uP,.%- - - P;,%, that qy is exactly 
divisible by p™. Now pq, ==0(P;,,), since P;,, is a maximal subideal of 
Hence P;,, is divisible at most by p"*1. If P;,, was divisible by p”™*', then the 
system of subforms Q(pq,) would have been a subsystem of O(Px,,). This is 
impossible, since Q(pq,) is of dimension = 1, while O(P;,,) is of dimension 0. 
Hence P;.,; is divisible exactly by p". We have thus proved that in the general 
polynomial f(z,y) in P;,,, the terms of lowest degree are of degree r,, i.e. the 
curve f =0 has at 0, an r,-fold point (while no curve f=0, fe Pri, has at 0, 


a multiplicity less than r,). 


(2) If P’; is the transform of P;4,, by a quadratic transformation T having 
at 0, a fundamental point, then B(P’,) = - where 0; is the 
point associated with P’;. To prove this, we first observe that from the fact, just 
proved above, that q, and Pz,, are both divisible exactly by the same power, p", 
of p, it follows that Q(Pz,,) is a subsystem of Q(q,). Since Pz,, is a maximal 
subideal of gq), the dimension of (q,) cannot exceed the dimension of Q(P,,:) 
augmented by 1. But Q(Px,,) is of dimension 0, while Q(q,) is of dimension 4. 
Hence a, 1. Let T(q,) where We 
assert that q’, is the immediate predecessor of P’;. If a, = 0, the assertion follows 
immediately from Theorem 4. 4, since we have in this case T* (qn) —=qy. Let now 
a, 1. Assume that q’, is not the immediate predecessor of P’, and let 9’, bea 
v-ideal between q’, and Let qm = Px, Qr = 
whence qm — Gr Pir (by Theorem 4.4). We have q,=0(q») == 0(p"?), and 
also since ==0(q,). We also have pom q,=0(q,), since 
9, is the immediate predecessor of P;,,;. If q, was divisible by p", then Q(q) 
would be a subsystem of (q,), and this is impossible, since Q(q,) is of dimension 
1, while 2(q,) must be of dimension 0. Hence q, is divisible exactly by p"?. Now 
we have v(q,) > (par) > V(PGm)— v(qy), and consequently por = 0( Pasi): 
This is a contradiction, since both pq, and P;z,, are divisible exactly by p” and 
Q(pq,) is of dimension 1. It is thus proved that q’, is the immediate predecessor 


of P’,. As a consequence we have 


From this relation our statement follows immediately by induction with respect to I. 
Our result implies that if the general curve f=0, fe Pr, passes through 


t 

is 

8 

la 

cl 

a 

is 

co 
fir 

in 

ex 

de 
din 

col 
{B, 

all 

| tog 
| suc 


POLYNOMIAL IDEALS. 201 


the quadratic transformation 'T passes through the points 0’,,0’.,- + -,0’%, with 
effective multiplicities 1. The identity between our definition of 
effective multiplicities and the customary geometric definition is thus fully proved. 
It is hardly necessary to add that the definition of the symbol B(9{) for any com- 
plete ideal amounts to postulating the relation B(@C) — B(B)B(C). 


13. Simple v-ideals and divisors of the second kind. Let # be a 
divisor of the field & = (a, y), i.e. an homomorphism of & upon a field 3’ of 
dimension 1 (and the symbol o), and let us assume that is of second kind 
for the ring O—f[z,y], i.e. that the prime ideal p determined in © by 
the divisor $$ is zero-dimensional, say p = (a, y) (we exclude the case in which 
zor y are mapped upon «). The points of the Riemann surface of the field 
Y define a set of valuations {Bg} of the field %, all of rank 2. Let {qi} be 
the Jordan sequence of v-ideals in © belonging to the valuation By. The ideal 
q:°” = p is independent of a. There may be other values of i such that q;‘” 
is independent of a, and, in particular, there may occur in the sequence {qi } 
simple ideals independent of « Their number is necessarily finite, because 
the simple v-ideals in the sequence {qi‘} determine the sequence completely 
(Theorem 6.2) and since the valuations By, are all distinct. Let Pp be the 
last simple v-ideal, of kind p, which occurs in all the sequences {qi}. The 
simple ideal ro, of kind p+ 1, will then vary with . We have thus asso- 
ciated with every divisor of 3, of the second kind, with respect to ©, a simple 
v-ideal Pp in D. If we apply p successive quadratic transformations, getting 
a polynomial ring [X,Y] of the new indeterminates Y, Y, the ideal bn 
is transformed into a prime 0-dimensional ideal p = (X, Y —c™), and the 
constant c“” must vary as @ varies. As a consequence, the divisor 8 is of the 
first kind with respect to the ring £[X, Y], and the corresponding prime ideal 
in this ring is necessarily the 1-dimensional ideal (XY). This shows that there 
exists a divisor for any preassigned simple v-ideal Pp and that ¥ is uniquely 
determined by Pp. We have then a one to one correspondence between the 
divisors of the second kind (with respect to the ring €[x, y]) and the 0-dimen- 
sional simple v-ideals in €[x,y]. The field 3’ upon which & is mapped by ¥ 
coincides with the field £(Y) and is therefore purely transcendental. 

It is important to point out that as the valuation B, runs through the set 
{Ba} determined by the points of the field 3’, the set re will include 
all the simple v-ideals Pp,, of kind p+ 1, such that Pp,, and Pp belong 
together to one and the same valuation. This follows from the fact, that all 
such ideals Pp,, are transformed by p successive quadratic transformations 
into the above ideals p™. 

The preceding considerations refer to the field f(z, y) of rational func- 


0 
e 
1 
1 
e 
st 
1, 
1} 
} 
e 
a 
d 
) 
n 
), 
d 
h 
y 


202 OSCAR ZARISKI. 


tions in x, y, but can be immediately extended to the field &* of meromorphic 
functions in z,y. If we put r=—z, y=—~2’y’, every holomorphic function 
y) assumes the form 2(f,(y’) + 2f2(y’) +). If is another holo- 
morphic function of x, y, and = 2’7(¢:(y’) + 2’$2(y’) we map the 
function upon 0, or f,(y’)/¢:(y’), according as p > 0, p<. o or p=o. 
This mapping defines an homomorphism of the field &* upon the purely trans- 
cendental field f(y’). We regard this homomorphism as a divisor of the second 
kind of &*, since the prime ideal in f{z, y} determined by this divisor is the 
0-dimensional ideal p* = (z,y). We associate this divisor with the simple 
ideal p*. In the same manner we may associate with any simple ideal ?,,, 
of kind h + 1 in F{z, y} a divisor of 3*, in which &* is mapped upon a purely 
transcendental field of one variable. We do this by first applying h successive 
quadratic transformations, getting a ring 0*,—=F£{2a, yn}, which contains 
f{z, y} and in which the ideal ?»,, corresponds to a simple v-ideal of kind 1, 
i.e. to the ideal (aa,yn). This ideal defines a divisor of the field 3*), of 
meromorphic functions of xn, yx, mapping &*, upon the field F(y’,) of rational 
functions of y’n (= yn/tn). In this homomorphism the subfield =* of 3*; is 
mapped upon the entire field £(y’,), since xn, yx are rational functions of x and 
y. We associate the divisor of &*, obtained in this manner, with the simple 
v-ideal Py,,. In exactly the same manner as for the field of rational functions 
of x and y, it is shown that the correspondence between all the divisors of 3%, 
defined by homomorphic mappings of = upon fields of dimension one with 
respect to f and of second kind with respect to f{2, y}, and the simple v-ideals 
in f{x,y} is (1,1), and that consequently any such divisor is purely trans- 
cendental (i.e. the field upon which &* is mapped by a divisor of the second 
kind is necessarily purely transcendental). 

We point out explicitly, that if a divisor of &* of the first kind with 
respect to €{x, y} is defined as an homomorphic mapping of =* upon a field 9, 
such that the prime ideal determined in f{z, y} by the divisor is one-dimensional, 
then © is necessarily the field of all meromorphic functions of one indetermi- 
nate. Thus for fields of meromorphic functions &* the classification of the 
divisors into two kinds is not merely a relative classification with respect to 
the ring f{x, y}, but rather a classification in terms of the properties of the 
field 3* itself. That this should be so is only natural, in view of the privileged 
role which the ring of holomorphic functions plays in the field &*. 

We conclude with one final remark. which finds an application in the 
proof of the well-known algebro-geometric theorem, that a pencil of curves 
on an algebraic surface is necessarily linear, if the pencil has a base point at 4 
simple point of the surface. This remark will be elaborated in a joint note 


! 

t 

a 
d 
vi 
R 
tc 
b 
se 
Ir 
th 
to 
f 
id 
fo 
12 
1. 
He 
the 
th 
| the 
Wi 
im 
8a 


POLYNOMIAL IDEALS. 203 


by Dr. O. Schilling and the present author. At this place we wish only to 
observe that the proof of this theorem is based upon the following assertion : 

Given a meromorphic function f(x, y)/o(ax, y), and assuming that the ele- 
ments f and o of €{x, y} are relatively prime and that neither is a unit, then 
there exists a divisor $8 of second kind of the field &* of meromorphic functions, 
such that f/p 1s mapped upon a transcendental element of the image field. 
This assertion can be readily proved as follows. Consider the ideal % = (f, ¢) 
in f{2, y}. This ideal is zero-dimensional, since f and ¢ are relatively prime 
and are both contained in the ideal (z,y). Let YW be the complete ideal 
determined by 9%. Let us first consider the case in which 2’ = p* = (z, y). 
In this case we assert that the required divisor ¥ is the one associated with p*. 
In fact, assume that f/@ is mapped by $$ upon a constant c. We may assume 
c=0, replacing f—c¢ by f. Under this hypothesis f/ will have positive 
value in all the zero-dimensional valuations B, defined by the points of the 
Riemann surface of $$, and hence, since ¢ =0(*), f must belong, for any «a, 
to the valuation ideal q.‘ which follows p* in the Jordan sequence of v-ideals 
belonging to Bg. As @ varies, q2‘* can be any maximal subideal of p* (see 
section 8) and is a simple ideal (of kind 2). Now for some a we will have 
¢=0(q."). In fact, it is sufficient to consider the valuation defined by an 
irreducible branch of the analytical locus ¢ = 0. For this valuation it is true 
that the element ¢# is contained in all the 0-dimensional v-ideals q; belonging 
to the valuation, whence also in q.. It follows then that for some a, both 
fand ¢ are contained in q.™. But this is impossible, since q2‘* is a complete 
ideal which does not contain the complete ideal 9%’ — p* determined by (f, ¢). 

In the general case, let 2’ = P,“P.%- - -P,%, where P; is a simple 
v-ideal of kind hi. Let h = max{h;}, and let us apply the quadratic trans- 
formation 2’ =x, y’ =y/xr. If W=0(p*), then by Theorem 
12,3, also W’=0(p*?), 0(p*e'), and putting 
%, = (1, 61), the complete ideal Y’, determined by %, is the transform 7 (%’), 


1, the ideal D’ a Pp’ 


Here each P’; is of kind hi —1, so that max{h; —1} —h—1. Since the 
theorem has already been proved for h =1 (2’ — p*), we may assume that 
the theorem is true for h —1. We may even assume that for the function f,/¢; 
the divisor whose existence is stated in our assertion is the divisor associated 
with a factor ?’,, such that P’, is of maximum kind h —1. Then it follows 
immediately that the divisor of the field 3* associated with the factor ?, 
satisfies the assertion. 


THE JoHns UNIVERSITY. 


204 


10. 


OSCAR ZARISKI. 


REFERENCES. 


Enriques, F. and Chisini, O., Lezioni sulla teoria geometrica delle equazioni e delle 
funzioni algebriche, vol. 2. 

Krull, W., “ Idealtheorie,” Ergebnisse der Mathematik und ihrer Grenzgebiete, IV, 3. 

Lasker, E., “ Zur Theorie der Moduln und Ideale,” Mathematische Annalen, vol. 60 
(1905). 

Macaulay, F. S., “ The theorem of residuation,” Proceedings of the London Mathe- 
matical Society (1), vol. 31 (1900). 

Macaulay, F. S., “ Algebraic theory of modular systems,” Cambridge Tracts in 
Mathematics, vol. 19 (1916). 

Macaulay, F. 8., “ Modern algebra and polynomial ideals,” Proceedings of the Cam- 
bridge Philosophical Society (1), vol. 30 (1934). 

Ostrowski, A., “Uber einige Légungen der Funktionalgleichung ¢(@) - ¢(y) 
=¢(a-.y),” Acta Mathematica, vol. 41 (1917). 

Priifer, H., “Untersuchungen iiber die Teilbarkeitseigenschaften in Korpern,” 
Journal fiir reine und angewandte Mathematik, vol. 168 (1932). 

van der Waerden, B. L., Moderne Algebra, vol. 2. 

Zariski, O., “ Algebraic surfaces,” Ergebnisse der Mathematik und ihrer Grenz- 
gebiete, III, 5. 


1. 
2. 
e 
3; 
4. ( 
5. 
6. 
0 
al 
9. 
|| ( 
a 
ok 
pe 
ba 
ur 
de 
ti 
int 
ig 
spe 
wh 


INTEGRAL FORMS AND VARIATIONAL ORTHOGONALITY.* 


By Hartman and RicHarpD KERSHNER. 


Introduction. The arc-length of a curve 4;—=2,(t); t2—=2(t), if it 
exists, is defined * by an integral of the form 


b 
(1) L (ays 2) + 
where (1) means the limit of approximating sums 
+ 


of the Riemann-Stieltjes type. It is well known that (1) exists whenever 2, 
and 22 are of bounded variation in [a,b]; and that, in case z, and 2 are 
absolutely continuous, (1) reduces to the ordinary Lebesgue integral 


In this paper the integral (1) will be investigated in cases when z, and 
2, are not absolutely continuous but possess, under the Lebesgue decomposition, 
a purely singular or a purely discontinuous component. The results to be 
obtained furnish means for the calculation of the length of curves given by a 
parametric representation for which (2) is not valid. These results will be 
based on the fact to be proved that, while (1) is clearly not decomposed linearly 
under arbitrary linear decompositions of 2, 72, nevertheless, for the Lebesgue 
decomposition, 2;(¢) = ai(t) + si:(¢) + pi(t), where a; is absolutely con- 
tinuous, s; is purely singular,” and p; is purely discontinuous, one has 


(3) L (#1, #2) = a2) + D(s:, 82) + L(pr, pz). 


It should be noticed that, since L(a,, a2) reduces to an ordinary Lebesgue 
integral and L(p,, p2) degenerates into an infinite sum, the usefulness of (3) 
is limited largely by the difficulty of calculating L(s,,s2.). Of course, in 
special cases it may be possible to eliminate the parameter between s,(¢) and 
8,(¢) and reduce this integral to one of the Lebesgue type. Additional results 
which sometimes lead to the explicit evaluation of L(s,,s2) will be given in 


* Received October 1, 1937. 
*Cf., e.g., Saks [7], p. 57. 
* A purely singular function will be supposed to be continuous. 


205 


206 PHILIP HARTMAN AND RICHARD KERSHNER. 


the sequel. In this direction the following fact may be mentioned here: 
If, for every ¢ in [a, b], either s’,(¢) —0 or s’,(t) = 0, then 


L (81,82) = V{[a, b]; 8} + Via, 6]; 82}, 


where V{[a,b];f} is the total variation of f on [a, b]. 
In the non-parametric case, the length L(y) of the continuous curve 
y = is given by 


L(y) = Bat + — lat. 


In particular, if y(x) is purely singular and monotone in [a, b], 


L(y) =| y(b) —y(a) |+|b—a|. 


This last statement is trivial in the case that y(2) is almost everywhere 
constant. 

At the suggestion of Professor Wintner the proofs of the facts mentioned 
above have been extended so as to apply, not only to the Euclidean arc-length 
(1), but to very general integral forms of the type 


(4) = 9{[a,b); X}— aX), 
where 
Y—Y(t) = ym(t)} and X—X(t) = {ai(t),- an(t)} 
are vector functions of ¢ on [a,b] and where 
F(Y;kZ) =kF(Y;2Z), (k=0). 


The best known integrals of this type are, of course, the Stieltjes integral 
b 


and the total variation of the function x(t) 


(6) —V{[a,b); dx |. 


Another simple example is the case 


a 


(7) a(X)=—( {S| (p>1), 
j=l 


of which the case p= 2 is Euclidean arc-length. The Riemannian arc-length 


(8) 0(X)— (3 


a 1,k=1 


( 
| 
| 
I 
( 
( 
t 
[ 


INTEGRAL FORMS AND VARIATIONAL ORTHOGONALITY. 


and the Finsler metrics 
b 
(9) P(X; dX) 


can also be considered as special cases of (4). 
In connection with Hilbert’s theory of bounded quadratic forms with 
continuous spectra, Hellinger [4] and Hahn [2] consider integrals of the form 


b 
(10) &(X) =H, (21, 22) -f 
and 
(11) = (2, £2) | dada. |}, 


and need, in order to describe the system of orthogonal invariants, especially 
astudy of the case that X(t) is not absolutely continuous. The general results 
to be obtained provide an essential simplification of the use of these integrals 
in this connection. 

In general, it is found, in conformity with (3), that if 


(12) X(t) = X,(t) + X2(t) + Xs(t), 


where X(¢) is of bounded variation, X,(t) is absolutely continuous, X,(¢) is 
purely singular, and X;(¢) is purely discontinuous, then, under very general 
conditions on the function F' occurring in (4), 


(13) ©(X) = + ®(X,) + 


In particular (13) holds when ®(X) is any one of the explicit types (5), (6), 
(7), (8), (10), (11). In the case of (9) some restrictions on F are, of course, 
required. 

Actually the Lebesgue decomposition (12) is only a special case of a large 
class of decompositions for which the linear property exhibited in (18) is 
demonstrated. Other decompositions, of a similar nature, are considered for 
which (13) does not hold precisely but requires corrective terms. 

Again in conformity with the case of Euclidean arc-length, it is shown 
that the first integral @(X,) occurring on the right side of (13) can be reduced, 
in all cases for which (13) is proved, to an ordinary Lebesgue integral with 
respect to ¢, namely 


(X,) X’(t))dt,  (X’—=dX/dt). 


Furthermore, the last integral, ®(X;), occurring on the right of (13) 
degenerates to an ordinary series of the form 


207 


208 PHILIP HARTMAN AND RICHARD KERSHNER. 


@(X;) = (pi) ; X(pi + 0) —X(pi—0O)), 


where the sum is taken over all discontinuity points p; of X(t). Thus (13), 
in a sense, reduces the investigation of integrals of type (4) to the case that 
X(t) is purely singular, i. e., that all components of X(t) are purely singular, 

The results to be obtained are, of course, trivial in the Stieltjes case (5) 
and essentially known in the case (6) of total variation but in all other 
cases they seem to provide a new analysis of the integrals under consideration, 


1. Definitions and conditions. The integral (4) may be defined as a 
direct generalization of the Riemann-Stieltjes integral in the following manner: 
Let a=t) < ty < + < be a mesh on the interval [a,b]. Then 
®(X) is the limit of an arbitrary sequence of sums 


(14) > AjX), =X (tj) < 


associated with any sequence of meshes for which A = max | t;,, — t; | 0, 
provided this limit exists and is independent of the particular choice of the 
sequence of meshes and of the intermediary values €;. Correspondingly, the 
integral (4) is said to exist if the sums (14) have such a unique limit. The 
integral (4) will be said to exist absolutely if 


(15) | 


as well as (4), exists. The corresponding generalization of the Lebesgue- 
Stieltjes type will not be considered here. 

It will be convenient to list certain of the conditions which will be used 
in the sequel. 

Throughout the paper it will be supposed that X(¢) is of bounded varia- 
tion, i.e., each component 2;(t) is of bounded variation, in [a,b]. It will 
also be supposed that X(t) is continuous from the left. This involves no loss 
of generality, since it necessitates, at most, the redefinition of X(t) at an 
enumerable set of points, a process which does not affect the value of the 
integral (4). 

No restrictions will be imposed on Y(t) other than that its components 
be Baire functions. Even this requirement is unnecessary for many purposes. 

Of F(Y;Z), in addition to 

(A) Positive linear homogeneity: F(Y;kZ) =kF(Y;2Z), (k = 0), 
which has already been mentioned, it will always be supposed that I’ is con- 
tinuous in the n variables z; together. 


f 


INTEGRAL FORMS AND VARIATIONAL ORTHOGONALITY. 209 


Conditions which will be required not always but on occasion, are the 
following: 

(B) Polar symmetry: F(Y(t); —Z) —=F(Y(t); Z). 

(C) Convexity: F(Y(t); Z,+Z.) SF(Y(t); Z:) + F(Y¥(t); 
In virtue of (A), for k = 3, this condition is equivalent to the usual notion 
of convexity with respect to Z. 

(D) For every e > 0, one can choose a & > 0 such that 


where Zj = {2j1,' °,2jn} and aSé& Sb. 
(D’) If Y(¢) and X(¢) are vector functions for which (4) exists, then 
for every « > 0, there exists a 6. > 0 such that 


| F(Y(&) 3X) | <e if | | <8. 
j=1 j=l k=1 


where {[t;, ¢’;)} is any set of non-overlapping, half-open intervals on [a, }], 
Ajay, = and S& < tj. 
(E) For every « > 0, there exists a 5. > 0 such that 


(6); Z;) — F(Y(&); W;)} | <e if > | | < &, 


where Zj = {2j1,° 2jn}, Wy = +, Win}, and SD. 


b 
(H’) If Y(t), X,(¢), X.(¢) are vector functions for which F(Y; dX) 
a 


b 
and ; F(Y;dXz.) exist, then one can choose, for every « >0, a & >0 
such that 
5 | 
if > | — | be, 


g=1 k=1 


the notations being analogous to those used in (D’). 
The relations between these last four conditions are indicated by the 
following table of implications, 


(E)->- (E’), (D)->- (D’), (E)->- (D), (D’), 


where, e.g., (E)-—-(H’) means that (E’) is satisfied whenever (E) is. 
Condition (D) is somewhat weaker than a Lipschitz condition on F( Y(t) ; Z) 


14 


), 
it 
) 
a 
n 
? 
1e 
e 
1e 
d 
a- 
ll 
ss 
ne 
ts 
8, 
{ 


210 PHILIP HARTMAN AND RICHARD KERSHNER. 


with respect to the n variables z; at the point z; = 0, which is uniform with 
respect to the parameter ¢ in the ¢-interval [a,b]. Correspondingly, (E) is 
satisfied if F(Y(t); Z) fulfills a uniform Lipschitz condition in the n 


independent variables z;. 


2. (X) asa set function. For later purposes it will be convenient 
to extend the definition of the integral 6(X) = @{[a, 6); X} to a set func- 
tion ®{8; X}, defined for all Borel sets 8 in [a,b]. It is known ® that if the 
point function f(u; X) — @{[a, uw) ; X} exists for every uina< ub, is of 
bounded variation, and continuous from the left in a<uw=b, then there 
exists a unique, bounded, completely additive set function ®{S; X}, defined 
for all Borel sets 8 in [a,b], such that if § is the half-open interval [a, w), 
asSt<u, then ${8;X} —f(u;X). 

This set function ®{S; X} may be obtained in the following way: Let 


f(u;X) =fi(u;X) —f2(u;X), 
where f, and f. are non-decreasing functions of w in [a,b] and let 
[fi(we) —fe(ue)], (t= 1,2), 
where aS uy, << wy, Sb and SC Ux). Then 
&{S;X} —6,{S; X} — X}. 


When such an extension is possible, the notation 


(16) F(Y; dX) 


will be used, in conformity with (4). 
Now there will be proved 


TuerorEM I. If (4) exists absolutely and F satisfies condition (D’), then 


j(u;X) — aX) 


exists for every uina<uSb and there exists a completely additwe sel 
function (16) defined for all Borel sets in [a,b] such that 


©{[a,u); X)— f “F(¥; dX). 


Proof. The fact that (17) exists for every wu in [a,b] is, of course, 4 


Cf., e.g., Radon [6], pp. 1305-1313. 


INTEGRAL FORMS AND VARIATIONAL ORTHOGONALITY. 211 


consequence of the existence of f(b;X). In fact, for every « > 0, one can 
choose a A, > 0 such that if 


is any mesh for which the degree of fineness A satisfies 

(19) A = max | tj. tj | < Ae, 

then 


for arbitrary intermediary values €;. Now if 
and 


are any two meshes having degrees of fineness A, A’ respectively, which satisfy 
(19), then 


VX) + AX) — | <e 


j=0 1 
r Dp b 

a 

Consequently, 


j=0 


whenever A < A, and A’ < A,. Thus (17) exists for every wu in [a,b]. 

As was mentioned at the beginning of this section, in order to complete 
the proof of Theorem I, it is sufficient to show that the function (17) is of 
bounded variation and continuous from the left ona< ub. That f(u; X) 
is of bounded variation may be seen from the decomposition 


f(wsX) f° | F(¥; aX) | — ax) | — F(Y; dX)} 


of f into the difference of two monotone functions. The existence of the 
integrals on the right side is assured by the assumption of the existence of 
(4) and (15), as has been shown above. 

Since X(¢) is of bounded variation and continuous from the left, for 
every usa << ub, there exists an » = yeu such that the variation 


V{{u’, u); X} dim | <8 if u—w 


= 


212 PHILIP HARTMAN AND RICHARD KERSHNER. 


where 6, is defined in (D’). In virtue of (D’), it is seen for any mesh 
w = to ty U, that 

p p mn 

> F(Y(&) ; AjX) | <e«, since >| | SV{[w,u); X} < & 


j=0 j=9 k= 


This implies that 
| f(w; X) —f(w’; X) |= 


F(Y;dX) | <y, 


completing the proof of Theorem I. 
Obvious modifications of the proof of the last part of Theorem I show 


that if S is any Borel set for which V{S;X}—0, then {89;X}—0. In 
particular one has 

THEOREM II. Under the assumptions of Theorem I, the functions 
f(u;X), &{8;X} are absolutely continuous, purely singular, or purely dis- 
continuous with X(t). 

Similarly, the first part of Theorem I can easily be extended to show that 
if S is a set consisting of a finite number of disjoint half-open intervals 
[wj, u’;), j= > +, 8, and if 


is a mesh over S such that the degree of fineness A satisfies A < A,, then 


(6x); —X(tyn)) — f. P(Y; dX) | 2e. 


j=l k=0 
Use of this fact will be made in the proof of Theorem III. 
8. Reduction to other forms. It has been shown that in case X(t) 


is absolutely continuous, the integral (4) reduces in many cases to an ordinary 
Lebesgue integral. The precise formulation of the result to be proved in this 


direction is 


TueoreM III. If the assumptions of Theorem I are satisfied and if 
X(t) is absolutely continuous, then 


b b 
(22) f F(Y; dX) -f F(Y(t); X’(t) dt. 
a a 
For the proof of this theorem, the following lemma will be needed. 


Lemma 1. If the assumptions of Theorem III are satisfied, then for any 
Borel set T on which X’(t) exists and 


| 


INTEGRAL FORMS AND VARIATIONAL ORTHOGONALITY. 213 


(23) AS F(V(t); X(t)) 
one has 
(24) Ameas T = ©{T; X} Spmeas T. 


Proof. By Theorem II, the function ®{S; X} is absolutely continuous, 
i.e., for every « > 0, one can choose an >0 such that | 6{9;X}|<e 
when § is any Borel set for which meas S < 7.. 

Now, since the set 7 occurring in the statement of the lemma is a 
bounded set, the covering theorem of Vitali implies that there exists a set 


U [w;, u’;) such that 

(25) 

(26) meas (7 —TU) < 1; meas (U —TU) < %; 
(27) | meas 7 — meas U | < ¢; 


(28) [X(w’5) —X(u;) — us]) < 
SE << Wj. 


Now, by (28) and the property (A) of F, one has 
(29) (A—e) (wj — uj) SP(V(&) X(w5) —X(uj)) < — uy). 


By adding the inequalities (29), one obtains 

(30) (A—e) measU => F(Y(é;) ; X(w’5) —X(uj)) < meas VU. 
j=1 

In virtue of (25) and the remark made immediately after Theorem II, 


(1) | —X(w)) — | <2 


by (26) and the definition of ». Finally, by (27), (30), (31), and (32), 
(A — ec) (meas T — ce) —4e S ®{T; X} < p(meas T + €) + 4e. 


< 2e 


The inequality (24) now follows since « > 0 is arbitrary. 
Proof of Theorem III. Consider a lower and an upper Lebesgue approxi- 


mating sum for the function F(Y(t) ; X’(t)). Then, by Lemma 1, 


(33) Sa, meas < Ain} SB 5X) 


1=-00 


Now 
| 


244 PHILIP HARTMAN AND RICHARD KERSHNER. 


and 
(34) Say meas < FSA} — O{[a, ; X} 
i=-00 4=-00 


where 8; = S{A SF < Xin} is the set of points ¢ in [a,b] for which 
F(Y(t); X’(t)) < Ain and = << has an analogous 
meaning. The inequalities (33) and (34) imply that the integral occurring 
on the right of (22) exists and that (22) holds. 

By Theorem II, if X(t) is purely discontinuous then ®{S ; X} is a purely 
discontinuous set function. In this case, also, the integral has a simple form. 


THEOREM IV. If the assumptions of Theorem I are satisfied and if 
X(t) is purely discontinuous, then 


aX) = (61) 5 +0) 


where the sum extends over all discontinuity points pi of X(t). 


Proof. By the remark immediately following the proof of Theorem I, 
it is seen that 


(35) aX) — F(Y;aX) —3 F(Y; dX). 
But, by definition, 
(36) dX) —lim 4X) 

= lim X (pi + €) —X(pi—e)). 


Theorem IV is a consequence of these relations. 


4. Variational orthogonality. Two functions 2,(t), 72(¢) of bounded 
variation on at b will be said to be variationally orthogonal on [a, b] 
if there exists a Borel set S such that 


 V{S;a,} = Vi{[a, 6]; 2}; (3%) V{S3 a2} = 0, 


where, as above, V{S; 2} = fl | dz | is the total variation of the function 


a on the set S. (It is clear that the seeming lack of symmetry of this defini- 
tion with respect to a, and a, is only apparent.) For example, if 2,(¢) 18 
purely singular and if z2(¢) is absolutely continuous, then 2, and 2, are 
variationally orthogonal and S may be chosen to be a zero set satisfying (37:). 
Similarly, if z,(¢) is purely discontinuous and if z2(¢) is continuous, then 
a, and #2 are variationally orthogonal and S may be chosen to be the enumerable 


se 
bo 
(3 
be 
(3 
the 
(4 
is 
fu 
cha 
(4 
wil 
ort 
con 
(42 
i.e, 
(43 
the 
chos 
pure 
trar 
tone 
be 
CuSsse 
the 
cond 
varta 
| 


INTEGRAL FORMS AND VARIATIONAL ORTHOGONALITY. 215 


set of points of discontinuity of 2,(¢). Finally let x(t) be any function of 
bounded variation and let 


(38) de | + de} 

be the positive variation of on [a, ¢) and 

(39) = a(t) —a(t) dz |—de), 
the negative variation of z on [a,t), so that, 


(40) u(t) =2,(t) — 


is Jordan’s decomposition of z(t) into the difference of two non-decreasing 
functions. Then z,(¢) and 22(¢) are variationally orthogonal and S may be 
chosen to be the set where 2’(t) 20 (including 2’(t) = + o). 

The two vector functions 


(41) Xi(t) = +,@m(t)} and X2(t) = (t),- tan(t)} 


will be said to be variationally orthogonal if 7,;(¢) and 2;(t) are variationally 
orthogonal for every j. The two vector functions (41) will be said to be 
completely variationally orthogonal if there exists a Borel set S for which 


(42,) V{S;Xi} = V{[a, (422) V{S;X2} —0, 
few l,- -, 0, 
(43; ) V{S; 215} = V{[a, 6]; (432) V{S; = 0. 


Thus, if X,(¢) is purely singular and X.(¢) is absolutely continuous, 
then X,(¢) and X.,(¢) are completely variationally orthogonal and S§ may be 
chosen to be a zero set satisfying (42,). The same remarks hold if X;(¢) is 
purely discontinuous and X.(t) is continuous. However, if X(¢) is an arbi- 
trary vector function of bounded variation and if X(t) = X,i(t) — X2(¢), 
tepresents the decomposition (38), (39), (40) of its components into mono- 
tone parts, then X,(¢) and X,(t) are variationally orthogonal but need not 
be completely so. 


5. Linearity. The property of complete variational orthogonality dis- 
cussed above is exactly what is needed to insure that the integral (4) satisfies 
the decomposition property exhibited in (13). 


THEOREM V. Suppose that (4) exists absolutely and that F satisfies 
condition (FE). Let X(t) —X,(t) + X.(t), where X, and X, are completely 
variationally orthogonal. Then 


216 PHILIP HARTMAN AND RICHARD KERSHNER. 


b b b 

(44) — + aX), 
a a a 

the existence of the last two integrals being proved. 


Proof. Since the conditions of Theorem I are satisfied, (16) exists and 


is a completely additive set function. 
Let S denote the set satisfying (42). Let «>0 be fixed. Let 


co 
U =U.=> [uj,u’j) be an enumerable set of pairwise disjoint, half-open 
j=l 


intervals which contains S. By the definition of ®{S ; X}, it is clear that U 
may be chosen so that 


(45) | ;X} — ®{U.;X} | <e; 
while from (422), 
(46) V{Ue;X2} V{Ue3 tox} < &, 


where 6, is defined in ZH. Now, since &, V are completely additive set functions, 


(47) => ws); V(Ue3X} u;); X). 


Thus, it is possible to select a sufficiently large integer N = N,, such that 


N 
and at the same time, 

co n 

(49) =D V{[uj,w’5) tu} < be. 

j=N+1 k=1 
Let a= ty < < tps be a mesh on [a, b] such that each and 
each u’;, for 7 =1,- - -, N, occurs among the t. Let the degree of fineness A 


of this mesh be less than the number A, defined in (19) and (20). It is clear 
that the requirement that w; and wu’; occur among the ¢; is no essential restric- 
tion on the generality of this mesh. Let 3’; denote the sum over those k-values 
for which the interval [t,, t’,) lies in an interval [u;,u’;), N. Let 3% 
denote the sum over all remaining k-values. Then 


S | (Y(&) ; —F(Y(&) 4eX)} | + (&) 5 |: 
Now, by (46), 


(51) | — | = | Ante; | SS < 
j=1 j=1 


TI 


anc 


a 
( 
I 
| 
| 
A 
(é 
CO 
va 
wh 
ne 
to 


INTEGRAL FORMS AND VARIATIONAL ORTHOGONALITY. 217 


and, by (49) and (42;), 
(52) D | Anas; | = 2 V{[uj, + V{[a,b] < &. 
j=1 jJ=N+1 k= 


It follows, from (50), (51), (52) and the property (E) of F, that 


(53) | (&) — | < 2. 


In virtue of the remark made following Theorem II and the fact that A < A,, 


N 
(54) | (Y (é&) AX) — @{uj, u's); X} | < 2. 
j=l 
Addition of the inequalities (45), (48), (53), and (54) gives 
(55) — X} | < be if A< Ay. 


Since the mesh employed is arbitrary, (55) shows that 


(56) F(Y; dX,) = 


exists. In exactly the same way, it can be shown that 


(57) F(Y; = ®{[a, b] —8;X) 


exists. Since ® is an additive set function, (56) and (57) together imply (44). 
This completes the proof of Theorem V. 
The proof of Theorem V implies 


THEOREM VI. Suppose that (4) exists absolutely and that F satisfies 
condition (E’). Let X(t) =X,(t) + X.(t), where X, and Xz are completely 
variationally orthogonal. Then 


whenever the last two integrals exist. 


In fact, condition (E) was used in the proof of Theorem V only in con- 
nection with the inequalities (50), (51), (52), (53) and it is obviously possible 


b 
to use condition (E’) in the same way if the existence of f F(Y; dX,) 
b 
and f F(Y; dX.) is assured. 
a 


THEOREM VII. Suppose that (4) exists and that F satisfies conditions 


218 PHILIP HARTMAN AND RICHARD KERSHNER. 


(B), (C), and (D). Let X(t) =X,(t) + X.(t), where X, and X, are 
completely variationally orthogonal. Then 


F(Y; dX) — ax) + 


the existence of the last two integrals being proved. 

Proof. In virtue of Theorem V, it is sufficient to prove that conditions 
(B), (C), and (D) imply condition (E) and the existence of (15). Now, 
by (C), 

P(Y(t); Z)—F(¥(t); W) SF(¥(t); Z—W) 
and 

F(Y(t); W)—F(¥(t); 2) SP(¥(t); W—Z). 
Thus, by (B), 
(58) | F(¥(t); Z2)—F(Y¥(t); W) | 

SF(Y(t); Z—W) =F(Y(t); W—Z). 

This inequality (58) shows, first of all, that F(Y(t); Z) =O for all Z, so 
that existence coincides with absolute existence. Also (58) and the condition 
(D) imply (BE). 

THEOREM VIII. Suppose that (4) exists and that F satisfies conditions 
(B), (C) and (D’). Let X(t) = X,(t) + X.(t), where X, and Xz are 
completely variationally orthogonal. Then 


F(Y; dX) — ax) +f F(Y; dX,), 


whenever the last two integrals exist. 


Proof. Examination of the proof of Theorems V and VI shows that 
condition (E’) is required only in the following forms, 


(6); A)X) —F(Y¥(é)); 4)X:)} | <e if 3 Sl <8, 


j=1 k=1 


| (6); A;X) — F(Y(é;) ; 4j;X2)} | <e if | Ajo | < &. 


k=1 
Thus, the required formulae corresponding to (58) are 


| F(Y(é); AX) —F(¥(é); |S F(¥(é); 4X2) 
| F(¥(é); 4X) — F(¥(€); 4X2) |S 4X). 


These formulae (59) are, obviously, consequences of (B) and (C). But (59) 
shows that the form of (E’) which is actually used in the proof of Theorem VI 
is implied by (D’). Thus Theorem VIII is a consequence of the proof of 
Theorem VI. 


(59) 


| 
(é 
(( 
if 
0 
de 
W 
b 
al 
fo 
(€ 
th 
j su 
Wé 


INTEGRAL FORMS AND VARIATIONAL ORTHOGONALITY. 219 


1. Let X(t) = X,(t) + X.(t) + Xs(t), where X,(t) is 
absolutely continuous, X2(t) 1s purely singular, and X(t) is purely discon- 
tinuous. Then, under the assumptions of any of Theorems V-VIII, 


It is understood that the existence of all three integrals on the right of 
(60) is either supposed or proved as in the corresponding theorem. 


6. Variationally orthogonal decompositions. It has been seen that 
if the vector X(t) is decomposed into the sum of completely variationally 
orthogonal vectors then, in a large number of cases, (4) is correspondingly 
decomposed linearly. In case X(¢) is decomposed into the sum of two vectors 
which are variationally orthogonal but not necessarily completely so, the 
behavior of (4) is described by the following theorems. 


THEOREM IX. Suppose that dX) F(Y; -, 
and fF (Y;0,da.,---,dan) exist absolutely. Suppose that F satisfies the 
owing condition: For every « > 0, there exists a 5 > 0, such that 

<e if 2n—wn | 


Let =2,,(t) + where 22: are variationally orthogonal. Then 


b 
(62) f F(Y; dX) = f +, dt») 
a 7a 
b 
+f F(Y; , dan) F(Y; 0, da2,° dan), 
the existence of the first two integrals on the right being proved. 


Theorem IX may be proved by an argument which is a modification of 
the proof of Theorem V and which will not be given here. It is found that 
the modification (61) of the condition (E) is all that is required in this case. 
A similar modification of the conditions (B), (C), (D), (D’), and (E’), in 
such a way that they apply only to the particular variable under consideration 
yield theorems which are analogous to Theorems VI, VII, VIII in the same 
way as Theorem IX is analogous to Theorem V. 

Repeated applications of Theorem IX to successive variables yield 


220 PHILIP HARTMAN AND RICHARD KERSHNER. 


THEOREM X. Suppose that (4) exists absolutely. Suppose that F satis- 
fies condition (ZH). Let X(t) =X,(t) + X2(t) + X(t), where X, and X, 
are variationally orthogonal vectors and X,=0. Then 


b 3 3 b 
a a 


(where k(t,°°*,%m) = > [1;/3] = number of values 7 for which 14; = 3) 
provided all the integrals occurring on the right of (63) exist absolutely. 


7. The Hellinger integral. The Hellinger integral is the special case 
of (4) for which F = 2z,’/ | z. | so that the integral has the form 


(64) @(X) = = (des)*/ | |. 


In dealing with these integrals it is always assumed that z2(¢) is monotone 
non-decreasing in the interval [a,b], so that the absolute value sign in the 
denominator of (64) is unnecessary, but it has been included here in order to 
make F satisfy condition (B). It is assumed that every interval of constancy 
of z(t) is an interval of constancy of z(t). Finally, the undetermined 
fraction 0/0 is defined to be 0. 

The following known properties of (64) will be needed. 


(i)® In order that (64) exist, it is necessary and sufficient that there 
exist a function h(t), monotone non-decreasing in [a,b], such that, if 
ast<t=b, then 


(65) — a(t) ]? S — — h(t) ], 


or, symbolically, 
(65) (Az,)*? S (Azz) (Ah). 


(ii)® Let 2,(¢) = gi(t) — g2(t) represent the decomposition (38), (39), 
(40) of 2,(t) into the difference of two non-decreasing, variationally orthogonal 
functions. Then, if (64) exists, and H1(g2, exist and 


Hf, (21, %2) = Ai Hy (92, 2). 


* Cf. Hellinger [4], Hahn [2], and Svenson [8]. 

5 Cf. Hellinger [4], p. 26f. or Svenson [8], p. 4. 

*Svenson [8], Theorem IV, p. 29. This statement is also a consequence of one of 
the modifications of Theorem IX above and, in fact, holds for any decomposition of 
#,(t) into variationally orthogonal parts. 


N 


wh 


a 
a 

I 

j 

d 
it 
a 

si 

® 
de 
de 

CO 
de 

80 

p. 


INTEGRAL FORMS AND VARIATIONAL ORTHOGONALITY. 221 


(iii)? If 2,(¢) is non-decreasing and if (64) exists, then the function 
h(t) in (t) may be chosen so that 2,(¢) is a basis ® of h(t). 

Now it will be shown that Theorems I, II, III, IV, and VIII are 
applicable to the Hellinger integral by proving 


LemMA 2. The Hellinger integral (64) satisfies conditions (B), (C) 
and (D’). 


Proof. Since conditions (B) and (C) are obviously satisfied by 
F = (%)*/| | it is only required to prove that (D’) also is satisfied. 

Let X(t) = {z,(¢),v2(¢)} be a vector function for which (64) exists. 
In view of (ii) it is sufficient to treat the case that 2,(¢), as well as x2(t), 
is non-decreasing. In this case, (i) and (iii) show that there exists a non- 
decreasing function h(t), of which z,(¢) is a basis, satisfying (65). Now, 
it is known ® that if z,(¢) is a basis of h(t), then for every « > 0, there exists 
a 8. > 0 such that 


8 


@) 
Ajh whenever Ajax, < &. 
1 j=1 


Now suppose that [Ajx, + Ajr.] < 8. Then, 


1 


~. 


co 
S Ajh <e, 
j=1 


since Aja, < Ajx, + Ajxz. But this is exactly condition (D’). 

Thus it is seen, in particular, that the Hellinger integral is additive under 
completely variationally orthogonal decompositions X = X, + X2, whenever 
©(X,) and ®(X,) exist. It will now be shown that this is always the case. 


LemMA 3. Suppose that (64) ewxists where 2(t), 1=1,2, are non- 
decreasing. Let x(t) =24(t) + (t), where xi; (t), 1,7 =1, 2, are non- 
decreasing and where the vectors X; = {X11,%12} and {&21,%22} are 
completely variationally orthogonal. Then Hy (#11, and H,(%21, 222) east. 

Proof. Since H,(x,,22) exists, by (i), there exists a monotone non- 
decreasing function h(t) satisfying (65). By (65) and Schwarz’s inequality 

SA jr, SX S 


so that, if § is any Borel set, 


*Svenson [8], pp. 11-12. 

*A monotone function b(t) is called a basis of the function f(t) if V{S; f} = 0 
whenever V{8;b} =0. Thus, h(t) is clearly a basis of a(t) by (65). Cf. Radon [6], 
p. 1318. 

*Cf. Radon [6], p. 1319. 


| 


222 PHILIP HARTMAN AND RICHARD KERSHNER. 


(66) S V{S; V{S; h}. 


Now by the definition (43) of complete variational orthogonality, there exists 
a set 7’ such that 


V{T; 214} = V{[a,b); a4} and V{T; 24} —0. 
Thus, if [¢, ¢’) is any half-open interval in [a, b), 


(67) Vi (t,t); tii} = 
and 
(68) V{[t,0)T; = V{[t, a4}. 


In particular, (67) and (68) give 
au} — V{[t, 2}, 


or, using (66), 


But, by (68), 
(70) Vili, 22} = V{[t, a2}. 


Together, (69) and (70) imply 


or 
(A211)? S (Ah). 


Hence, by (i), H(a11,%12) exists. The existence of H(2%21,%22) is shown 
similarly. 

Some of the results of this section are summarized, for the particular case 
of the Lebesgue decomposition, in 


COROLLARY 2. Suppose that (64) exists. Let 
a; (t) = a(t) + si(t) + pi(t), (11,2), 


where a, is absolutely continuous, s; is purely singular, and p, is purely 
discontinuous. Then 


1, %2) = H,(a;, a2) + Hi (81, 82) + Hi (pr, pe) 
or 
+ + 0) —2,(pi — 0) ]?/[22(ps + 0) —22(ps —9)]; 


where the last sum is taken over all discontinuity points px of x(t). 


g 


( 
T 
It 
th 
tic 
Ne 
on 
(7 
by 
an 
(7 
wh 
hal 
it 
Th 
(%4 
wh 
tha 
Ags 


INTEGRAL FORMS AND VARIATIONAL ORTHOGONALITY. 223 


It may be mentioned that the results of this section also apply to the 
ab 
generalisation,*° J | da, |?/ | da, |?*, p > 1, of Hellinger’s integral. 
a 


8. The Hilbert integral." The Hilbert integral is the special case of 
(1) for which F = | 2122 |3, so that the integral has the form 


b 
(71) = (a, £2) -f | |}. 
a 


This integral exists whenever X(t) = {2,(t),%2(t)} is of bounded variation. 
It will now be shown that Theorems I, II, III, IV and VI are applicable to 
this integral by proving 


Lemma 4. The Hilbert integral (71) satisfies condition (E’). 


Proof. Let X;(¢t) = {i1(t), Vio(t) }, 1 = 1, 2, be vectors of bounded varia- 
tion, so that and H 22) exist. Let M = V{[a, ; viz}. 
j=1,2 
Now, from 


| ac |* + | ad + | be + | ba fA, 


one obtains 


(72) | — | Ate: Adee S | Avy, — Adee | Ave, |? 
+ | — |? | | A%12 — |3 | — Ate, [3, 


by letting Av,, =a b, Avy, =c + d, Ate; =a and Using (72) 
and the Schwarz inequality, one has 


(73) > {| | ays = | — | }4 


where = Lin and {[t;, is set of non-overlapping 
half-open intervals on [a, b]. 

By repeating the above argument with the subscripts 1 and 2 interchanged, 
it is seen that (73) still holds if the absolute value of the first sum is taken. 
Thus 


oO 2 
j=1 j=l k=1 


where §, = ¢?/16M. But (74) is precisely condition (E’). 

Recalling the fact that condition (E’) implies condition (D’) it is seen 
that the theorems mentioned above are applicable to the Hilbert integral (71). 
Again restating the particular case of the Lebesgue decomposition, one has 


10F, Riesz [6]. 
" Hellinger [4], p. 31 f. 


| 


224 PHILIP HARTMAN AND RICHARD KERSHNER. 


3. Let =ai(t) + + pi(t), 1 —1, 2, where a, is 
absolutely continuous, s; 1s purely singular, and p; is purely discontinuous. 
Then 

2(%1, = H2(di, + He(s1, 82) + Ho(pr, pe) 
or 


+ | + 0) | + 0) — 0) ff, 


where the last sum is taken over all common discontinuity points p; of 2,(t) 
and 22(t). 


9. Riemannian arc length. The length of curves in Riemannian spaces 
is given by the integral 


b n 
(75) f > Ji didi 
a 4,k=1 


where gix(t) = is the matrix of a positive definite 
quadratic form. It is assumed that giz(¢) is continuous.on [a,b]. Then the 
integral (75) exists whenever = {z,(t),- an(t)} is of bounded varia- 
tion. It will be shown that Theorems I, II, III, IV, VII, IX and X are 
applicable to these integrals (75) by proving 


Lemma 5. The integral (75) satisfies conditions (B), (C) and (D). 


Proof. First, the condition (B) is obviously satisfied. 
To see that condition (D) is satisfied, let M—max| gix(t) | for 
=1,---,n. Then 


n & n 
gix(t) < Mi | |. 
=1 


4,k=1 


Finally, the inequality required in (C) is known ” to hold in this case. 


a; ts absolutely continuous, s; is purely singular and p; is purely discontinuous. 
Then 
b n n 


4,k=1 


12 This inequality is equivalent to inequality 29 in Hardy, Littlewood and Pdélya 
[3], p. 33. 


— ~ ~~ 


ti 


sy 


va 


| 


INTEGRAL FORMS AND VARIATIONAL ORTHOGONALITY. 225 


10. Euclidean arc length. The ordinary Euclidean arc length is, of 
course, the special case of the above for which gi = Six. However, the results 
obtained take such a simple form, in this case, that it seems worthwhile to 
enumerate them. For simplicity of statement, these results will be formulated 
in the case n = 2. 


5. Let a(t) =ai(t) + + pi(t), 2, where a, is 
absolutely continuous, s; is purely singular, and p; is purely discontinuous. 
Then 


+ Q {(ds:)? + (dsz)?}* + + 0) —2i(pi —0)]? 
+ [22(pi + 0) — 0) 
where the last sum is taken over all discontinuity points pi of x,(t) or of x(t). 
CoroLtary 6. Suppose S(t) = {s:(t), so(t)} ts a continuous vector 


function of bounded variation, such that, for every t on [a, b], either s’,(t) =0 
or =0. Then 


In fact, in the decomposition S — S, + S., where S,(¢) = {s,(¢), 0} and 
S.(t) = {0,s.(t)}, the two vectors S, and S, are completely variationally 
orthogonal. 


7. The length of the curve y= y(a), where y(ax) is a con- 
tinuous function of bounded variation in [a,b], is given by 


~ 


This is a consequence of Corollary 5, for in this case a,(t) =1, 
t 
as(t) y’(u)du, =0, s(t) = y(t) —aa(t). 


Corotuary 8. The length of the curve y= y(«), where y(x) is a purely 
singular monotone function in [a,b] is given by 


| ¥(b) —y(a) | + |b—a|. 


Corottary 9. Let x,(t) =g(t)—h(t) be the decomposition (38), 
(39), (40) of «,(t) into the difference of two monotone non-decreasing, 
variationally orthogonal functions. Then 


15 


226 PHILIP HARTMAN AND RICHARD KERSHNER. 


b 
b 
f {(dh)? + (da2)?}#— V{[a, b) ; 22}. 
This corollary is a consequence of Theorem VIII. 


Coronary 10. Let = gi (t) —hi(t) be the decomposition of x;(t) 
into the difference of two monotone non-decreasing variationally orthogonal 


functions. Then | 
7b b a 
{ (des)? + ((dgs)? + 
a a 
+f {(dha)? + (dhe) + + 
7a a 
b 
a 
— V{[a,b); —V{[a,b); 22}. 
This corollary is a consequence of Theorem IX. I 
In Corollaries 9 and 10 it is clear that the variational orthogonality of i 
gi and h, is all that is needed to insure the result. 
tr 
THE JOHNS HOPKINS UNIVERSITY, 
UNIVERSITY OF WISCONSIN. te 
pe 
Se 
we 
BIBLIOGRAPHY. 
[1] P. Finsler, “tiber Kurven und Flichen in allgemeinen Riumen,” Dissertation, De 
Gottingen (1918). 
[2] H. Hahn, “ tber die Integrale des Herrn Hellinger und die Orthogonalinvarianten 
der quadratischen Formen von unendlich vielen Veriinderlichen,” Monatshefte int 
fiir Mathematik und Physik, vol. 23 (1912), pp. 161-224. me 
[3] G. H. Hardy, J. E. Littlewood, and G. Pélya, Inequalities, Cambridge (1934). dis 
[4] E. Hellinger, “ Die Orthogonalinvarianten quadratischer Formen von unendlich- ; 
mc 


vielen Variablen,” Dissertation, Gottingen (1907). 

[5] J. Radon, “Theorie und Anwendungen der absolut additiven Mengenfunctionen,” 
Sitzungsberichte der Kaiserlichen Akademie der Wissenschaften, Wien, Math.- a 
Nat. Klasse, vol. 122 (1913), Abth. 2a, pp. 1295-1438. 

[6] F. Riesz, “ Untersuchungen iiber Systeme integrierbarer Funktionen,” Mathematische 
Annalen, vol. 69 (1910), pp. 449-497. 

[7] S. Saks, Théorie de Vintégrale, Warsaw (1933). 

[8] E. Svenson, “ Beitriige zur Theorie gewisser Integraltypen,” Abhandlungen der 
Herder-Gesellschaft und des Herder-Instituts zu Riga, vol. 5 (1936), no. 6. suff 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES.* 


By HERBERT BUSEMANN. 


In a well-known paper, Hilbert? has characterized the Euclidean and 
hyperbolic plane geometries by mere group and continuity axioms. He gets 
all the motions at once by requiring the existence of sufficiently many rotations. 
The present paper tries to point out how the existence of more and more 
translations gradually specializes the rather general metric it starts with to a 
Desarguesian geometry (Minkowskian or hyperbolic). 

We require our initial space to be a “ Geradenraum ” in Menger’s termi- 
nology. The exact definition will be found in section 1. It is essentially a 
metric space with exactly one shortest line (s.1.) of infinite length passing 
through two given points. We therefore call a Geradenraum an SL space. 
In order to be able to formulate the results, we must have the concept of an 
asymptote. If g is any shortest line, P a point not on g, and Q a point 


traversing g in a certain direction g (g), the s.1. connecting P and Q always 
tends to a limit s.1., which we call an asymptote to g, preserving the word 


parallel for the case where the two asymptotes to g and g through P coincide. 
Section 1 gives those properties of these asymptotes and of limit circles which 
we shall need later on. 

A two-dimensional SL space & will be called a plane since it is homeo- 
morphic to the Euclidean plane.* As usually, we say the metric of & is 
Desarguesian if it is possible to map & topologically on the Euclidean plane 
or a convex part K of it in such manner that the shortest lines are transformed 
into straight lines or into the intersections of straight lines with K.A By a 
motion of } we mean any one-to-one mapping of & onto itself which preserves 
distance and, in particular, by a translation along a shortest line g we mean a 
motion which transforms each of the half planes of & defined by g into itself. 

We first assume that to each pair A, B of points on a fixed s.1. g there 


* Received April 19, 1937. 

* Reprinted in [5], Anhang IV. The numbers [n] refer to the references on p. 256. 

* [7] especially pp. 100-113. 

* See [2]. 

‘A proof for the fact that the validity of Desargues’ Theorem is necessary and 
sufficient for the existence of such a mapping, can be found in [11] §§ 3, 4. 


227 


> 


228 HERBERT BUSEMANN. 


exists a translation along g carrying A into B; these translations form a 
group G. In section 2 we ascertain which of the usual properties of trans- 
lations hold, and show by examples that others do not. The main result is that 
the images of a fixed point A under the translations of G form a curve which 
together with g bounds a convex domain. It follows then, for instance, that a 
parallel h to g is equidistant from g and that the translations along g can also 
be regarded as translations along h; but such a geometry is not necessarily 
Desarguesian, even if the parallel axiom holds throughout the plane. On the 
other hand, it can be Desarguessian with the parallel axiom holding only with 
respect to g. 

The availability of G does not imply the existence of translations along 


a non-parallel asymptote to g (or g). By assuming their presence one there- 
fore gets a much more specialized metric (section 3), in which translations 


along each asymptote a’ to g exist and the hyperbolic formula for the arc of 


the limit circle to the direction g between a’ and g holds. Nevertheless an 
example will show that the geometry is not necessarily Desarguesian. 
However, as soon as one requires the existence of translations along two 
s. 1. where the one is neither an asymptote nor parallel to the other, we get a 
Desarguesian metric. We find, namely, (section 4) that the metric is either 
Minkowskian® or hyperbolic. In both cases there exist translations along 


each straight line. 


1. Asymptotes and limit spheres in SL spaces. The exact definition 
of an SL space is as follows: A complete metric space with the distance func- 
tion r(z,y) is an SL space if it satisfies the conditions: 


I. To each pair of points A, B there exists exactly one point C, the center 


“(A.B 
of A and B for which r(4,C) +r(C, B) =r(A, B), r(4,0) 


II. To each pair of distinct points A, B there exist exactly two points 
D and D’ such that B is the center of A and D and A the center of B and J. 
All points X satisfying the relation 


r(A,X) + 7r(X, B) = r(A, B) 


form a point set (designated by AB or BA), which is homeomorphic to an 
Euclidean straight line segment. If CAB and X ~A, X $B we say 


5 The original definition is in [8], chapter I, compare also p. 234 of this paper. 
* loc. ctt. 


seq 


} 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES. 229 


that X is between A and B. The set of points Y (resp. Z) for which B is 
between A and Y (resp. A between B and 7) together with AB is homeo- 
morphic to a Euclidean ray and will be designated by AB, resp. BA. We put 

AB=AB+BA 
and call AB a shortest line (s.1.).. AB is homeomorphic to a Euclidean 


straight line. Then the following theorem holds: 


(1.1) Through two different points passes exactly one s.1. Hence two dif- 
ferent s. 1. intersect in at most one point. 


As usually we put for any two point sets 


r(a, B) =G.1.b.r(X, Y) 


We say that the point sets a converge to the set « if « contains all limit 
points of sequences {Pv} with Py C a and if to any point O, any positive 
number 7, and any « > 0, a number N(e,0O,7) can be ‘found such that for 
v>WN and each point X C @ with r(X, 0) < r the inequality 


r(X, a) <e 
holds. Then we have 


(1.2) If Av->A and By—>B, then AB; and if AB, also 


A AB, BA, AvBy > A 


—, 


If, for a given point P, a point Q < @ exists with 
(P,Q) =r(P, 


we call Q a foot of P on a Then the following lemma is an immediate con- 
sequence of the triangle inequality. 


(1.38) If RyoR, a —4, and Ty is a foot of Rv on ay, then each accumula- 
tion point T of Ty is a foot of R on a. 


Let g be any s.l, PG g, ACg, and X a variable point on g. 
r(X, A) > implies r(P, X) — ; hence, P has at least one foot Q on g. 
A point R between P and Q has Q as its only foot on g; for, from 


r(R,Q’) 


y 


230 HERBERT BUSEMANN. 


would follow 
r(P,Q’) <r(P, Q@) =1(P, 9). 


We shall have to use later the fact: 


(1.4) In an SL space of dimension greater than 1, there exist to each s.1. 
g points whose distance from g is arbitrarily great. 


To prove this we show that a sphere with radius a whose center C is on 
g’ contains points P with r(P,g) =a/2. Let B,, Bz be the points on g with 
r(B,, C) =r(B2,C) =a, B’, the center of B,C, B’, the center of B,C and 
A an arbitrary point not on g. We draw B,A and B.A and consider the set ¢ 
formed by all rays CX with ¥ C B,A + AB,. Let K be the intersection of 


the sphere with o; K is homeomorphic to a Euclidean semicircle. The feet 
on g of a point PC K near B, are near By (t—1,2). If one had 
r(P,g) <a/2 for all PC K no foot on g of a point PC K could belong to 
B’,B’.. It is easy to see that there must be a point P, with two feet Q,, Q. 
on g such that Q, C B’,B,, Q2 C B’.B, and one would have 


a>r(Po,Q:1) + 7(Po, =7(Q1, =a. 
(1.4), together with the preceding statement, gives: 


(1.5) In an SL space of dimension greater than 1, to a gwen s.l. g anda 
given N > 0 points P can be found with a unique foot on g and r(P,g) =N. 


We now consider an oriented s.l. g and write Q < Y for two points () 
and Q’ on g if Q’ follows Q. If P<Q< Q’ it follows from the triangle 


inequality that 


(a: means the product of the point sets a and 8) where K(X,Z) means 
the sphere with the center XY and through Z, i.e. with the radius r(X, Z). 
More exactly, K(P,Q) — P lies in the interior of K(P,Q’). From this we 
conclude: if P< Qi: and r(P, Qv) the spheres K(Q, P) 
tend to a limit set L(P,g), which does not depend on the choice of the 
sequence {Qv}.2 We call L(P,g) the limit sphere through P with or to the 
center ray g. In terms of the metric of the space, L(P, g) can be characterized 
as follows: it consists of those points R for which 


7 By the sphere with radius a and center C we mean the set of points Z with 


® For detailed proofs of this and the following statements see [3] § 2. 


al 


E 
(1. 6) K(Q, K(Q',P) =P 
| 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES. 


(1.7) r(Qv, P) —r(Qv, R) 0 


for each sequence < <<: with r(P,Q) > 

One proves with the help of (1.7) that the limit spheres with the center 
ray g cover the space. L(P,g) decomposes the space into two parts, the in- 
terior and the exterior of L(P, g), the former consisting of those points which, 
for P < Q and sufficiently large r(P, Q), are in the interior of K(Q, P). 

We call the point set » equidistant from the point set v if, for any two 


points P;, P2 of » one has ‘ 


r(Pi,v) = r(P2, y= v). 


If » is equidistant from v, then v is not necessarily equidistant from v. 
But for limit spheres one has 


(1.8) Limit spheres with the same center ray g are equidistant from each 
other. Hence exactly one limit sphere with g as center ray passes through a 
given point. 

With the help of these results we can prove a lemma which will be applied 
frequently in the sequel. 


(1.9) Lemma. Jf the s.l. s and a intersect in T then the parts of a and s 
outside any sphere around T with positive radius have positive distance. 


Let us assume, on the contrary, that there exists a sequence {R*,} on a 
and a sequence {S*,} on s, such that 


r(R*,, but r(k*,,7T) 
Then we must have r(R*y, T) > o. 
Let A be a point on a different from T. Either on AT or on TA are 


infinitely many points Ry of {R*,}; suppose they are on 7'A, and call {Sy} 
the subsequence of {S*y} corresponding to {Ry}. The two limit spheres 
L(A,TA) and L(A, AT’) have no common interior point. Since r(Sv, Ry) > 0, 
the s.1. s intersects both these limit spheres, the former at B, say, and the 
latter at B’, with B’C TB. On account of (1.6) the sphere K(7, A) inter- 
sects the segment TB’ ina point B” between and B’; hence, r(T,B)>r(T, A). 
On the other hand, we have | r(B, Sv) —r(B, Rv)| < r(#v, Sv) and 


r(B, Ry) —r(A, Rv) (see (1.7)). 


231 


232 HERBERT BUSEMANN. 


Hence 
r(B, Sy) [r(A, Ry) r(R, Sy) ] — 0 
and 


Sv) —[r(T, Rv) + 1(Rv, Sv)] > B) A) > 0, 


which for large v contradicts the triangle inequality. 
From reference [3], section 2, we take the following theorem: 


(1.10) An arbitrary point A has a unique foot on each limit sphere. The 
feet of A on the different limit spheres with the same center ray g formas. l.a. 


We call a the asymptote to g through A. Two asymptotes to g are either 
identical or disjoined. In order to justify the use of this term we show 


(1.11) If <g, < +, and r(Q*,, > then AQ*, 
tends to the asymptote a to g through A. 


Proof. Let L(P,g) be the limit sphere with the center ray g through A, 
and {Qv} a subsequence of {Q*,} for which AQy converges to as.1. 6. (That 
{Qv} exists follows from (1.2)). We have to prove that b coincides with a. 
For r(A,Qv) >1 we can choose the points Ry and 7'y on AQy such that 
r(A, Ry) =1 and r(Qv, P) =r(Qv, Tv) (that r(Qv, A) > 7r(Qy, P) follows 
from (1.1)). The points Ry tend to a point # on b with r(A, Rk) —1, F# is 
not on L(P,g). Ty is the foot of R on K(Qv,P). We therefore conclude 


from (1.3) and K(Q,,P) > L(P,g) that each limit point of {7 } must be 
a foot of R on L(P,g). (1.10) shows that R has only one foot 7 on L(P, 9), 
namely the point where the asymptote RT to g through F intersects L(P, 9). 


Therefore, 
b =lim QA = lim RT = kT 


Hence 6 is an asymptote to g and from b-a—- A follows b =a. 

We can formulate our results thus: 
(1.12) Jf Q traverses the s.l. g in a certain direction 9 and P €& g, then 
PQ tends to a limit s.1. a, the asymptote to q through P. The asymptote 
to 9 through a point of a coincides with a. A converging sequence of asymp- 
totes to g tends to an asymptote to 9. 


From now on we suppose that our SL space, indicated by 3, has dimet- 
sion 2. It is therefore hemeomorphic to the Euclidean plane.* The spheres 


f 


ai 


qu 


Wi 


~ 
j ‘ 
j 
| 
| 
| 
it 
i 
ii 
4 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES. 233 


are homeomorphic to Euclidean circles; we therefore speak of circles and limit 
circles instead of spheres and limit spheres. Through each point A not on 
as.l. g we have two (possibly coincident) asymptotes to g. They determine 
two angles, an open one and closed one. The former is determined by the 
property that it contains all the s.1. through A intersecting g and no others; 
the latter is the complement to the former. If the two asymptotes coincide, 
the closed angle consists of exactly one s.1., which we call a parallel to g. 


Let a be an asymptote to g. The question arises as to whether the family 
of the limit circles to a is (as in the hyperbolic and Euclidean geometries) 
always identical with that of the limit circles to g. We shall see that only 
part of this is true: 

(1.13) Let ~* be that one of the two half-planes defined by a which is 
contained in one of the half-planes determined by g, x** the other one. Then, 
for each P C g. 
(LT =L(P,g)-a), 
and L(T,a)x** is in the interior or on L(P,Qg). 
To prove this, we first state that for each pair of distinct points A, B and 


each pair of positive numbers a,b, with a+ b > r(A, B), exactly two points 
T and T’, one on each side of AB, exist such that 


(1.14) 1(A,T) —=r(A,T7’) =a, T) =r(B,T’) =b2 


If b =a, we can say, furthermore, that TT” lies in the interior of the 
quadrangle AT BT’. For, otherwise, we would have 


r(T,A) =—r(T, B), A) =r(T’, B) 
with A and B on the same side of TT’. 


We now consider any point P of g. L(P,g) may intersect ain T. Let 
A be any point following P on g, and B the point ona withr(B,T) =r(A,P). 


Except for the point 7’, the circle K(B,T) lies in the interior of L(P,q), 


since J’ is the unique foot of B on L(P,g). This statement contains the 
second part of (1.13). On account of (1.14) the circles K(A,P) and 
K(B,T) intersect exactly twice for sufficiently large r(A, P). Since K(A, P) 


Compare [2]. 


> 

— 

| 


234 HERBERT BUSEMANN. 


is (except for P) in the interior of L(P, g), one of the intersections, for large 


r(A, P), must be between a and g near the arc PT of L(P,g), and the other 
outside of an arbitrarily large circle around T. Hence, for large r(A, P), 


an arbitrary great portion of K(B, 7’) -x* lies between K(A, P) and L(P, q), 
which proves the remainder of the assertion. 
It follows that: 


= 
(1.15) The asymptotes to a in x* are also asymptotes to g. 
This can, of course, be easily seen without using (1.13). 


We now show by an example that, even in a Desarguesian geometry with 
the Euclidean parallel axiom it can happen that in the above notation 


L(T, a)x** lies wholly in the interior of L(P, 9). 

We recall the definition of a Minkowskian geometry: Let r—A(q¢) bea 
convex curve in the strict sense in the Cartesian (r,¢) or (2, y)-plane with 
r= 0 as center. We define the distance between two points P; = (%, 4), 
= (22, to be the number 


1 
#(Pi, P2) = V (41 — + (ys — ¥2)" 
where + ¢ or — ¢ is the direction of the straight line P;P2, i.e. 


As shown by Minkowski #(P,, P.) satisfies all our conditions and the whole 
Euclidean straight lines are the s.]. of our space. Now let r—A(¢) be 
the curve 


2 
=—1 for y=0, 
y 
= or == VU. 


The limit circles to center rays with directions different from 0 and 7x are 
straight lines. The limit circle to the positive z-axis through 0 is the curve 


for y=0, 
(*) for 
The limit circle to a center ray yf parallel to the positive 2-axis 
through the point («, 8) is obtained from (*) by a translation which carries 


(0,0) into One sees: if 8 > 0 and«a— (for B < 0, a =— the 


t 

t 

a 

tl 

I 
in 

H 

st 

ne 

spe 

of 
one 
“9 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES. 235 


two limit circles are identical only for y= 8B (y= 8) and the second is in 
the interior of (*) fory<B (y> 8B). 

I do not know if the concept of an asymptote is always symmetric, i. e. 
if from the fact that a is an asymptote to g always follows that g is an 
asymptote to a. A sufficient condition is that the limit spheres to a are also 
limit spheres to g, but this condition is not necessary, as we have just seen. 
Another sufficient condition is given by 


(1.16) If the distance between two non-intersecting s.1.’s a and b vanishes, 
then a is an asymptote to b and b to a. 


For, there exists a sequence of points #y on b, tending to infinity in a 
certain direction, say b, and a sequence of points S on a with r(S), Ry) > 0. 


We select an arbitrary point A ona. The s.1. Ay tend to the asymptote c to b 
through A. If c is not identical with a, it must intersect SyRy in a point S’y 
and we would have 7(Sy, 8’,) +0 in contradiction to (1.9). 

The converse to (1.16) cannot always hold, since it is not true in 
Euclidean geometry. But one could conjecture that it is true if the parallel 
axiom of hyperbolic geometry holds throughout the plane, especially in a 
Desarguesian geometry in a bounded part of the Euclidean plane. To show 
that this 1s not so, we recall the definition of a certain geometry introduced by 
Hilbert.1° Let K be any bounded, closed, convex curve in the plane. The 
distance between two points A, B interior to K is defined as follows: Let AB 


intersect K in Y and BA in X. Designating the Euclidean distance by e( ) 
we put 

e(A, Y) -e(B, X) 

e(B,Y)-e(A,X) 


r*(A, B) = log 


Hilbert proves that r*(A,B) satisfies our conditions if K is convex in the 
strict sense and that otherwise the straight lines are shortest lines but not 
necessarily the only s.1. Therefore 


r*(A, B) + e(A, B) 


is, for each bounded, convex curve &, a distance function which defines an SL 
space. Choosing K as a triangle, one sees that the s.1. issuing from a vertex 
of K are asymptotes to each other, but the distance between any two distinct 
ones among these s.1. is positive. One should note that the distance of the 
“other ends ” of two such s. 1. is finite. 


See [5], Anhang I. 


236 HERBERT BUSEMANN. 


2. Geometries with a group of translations along one shortest line, 
Let g be as.1. in the two dimensional SL space 3%, 7, and z,2 the two half- 
planes into which g decomposes =. A one-to-one transformation of 3% into 
itself, which preserves the distances of corresponding pairs of points and 
transforms 7, 72 into themselves will be called a translation of & along 4. 
Such a translation transforms a s. 1. into a s. 1. 

To a given pair of points A, B on g there exists at most one translation 
along g carrying A into B. For let C be any point on g different from 4, 
then any translation y along g carrying A into B transforms C’ into the point 
Deg with y(C, D) = y(A, B) and either 

ABOCD or AB. 
Since (compare 1.14) a point X of z, is uniquely determined by the two 
distances r(X, A) and r(X,C), y must carry X into the unique point 1’ Cn, 
with r(X’, B) =r(X, A) and r(X’, D) =r(X,C). This proves our statement. 

A translation y along g, therefore, is uniquely determined by the fact that 
it carries the first of a pair of points A, B on g into the second. We designate 
y by (A—B). It follows that 


(A>B)(B3C)=(A3C) (first (A> B)) 


and since the transformations of g into itself induced by translations along g 
are commutative, the translations along g are commutative. We now assume 
that to each pair of points A, B on g the translation (A — B) exists. Then 
these translations form an Abelian group G. Hereafter, we indicate the as- 
sumption of the presence of G by saying that all translations along g exist. 
We first notice some simple consequences of the existence of G. 


(2.1) Translations different from the identity have no fixed points. 


If P remained fixed under (Ay —> A,) 1, P would also be fixed under 
the positive powers Az), (Ap —>Az),°** Of (Ao—> Ai). We should have 
r(Ao, P) =1r(Ai, P) P) 

but r(Ao, Av) =v-1(Ao, Ai) 


(2.2) Hach point P has a unique foot on g. 


Assume that P C =, has two different feet F; and F, on g. The proof 
of (1.5) shows that we can choose a point Q in z, with a unique foot F on G 
and r(Q,g) +r(P,F2). Let F’ be any point between and 


( 
i 
( 
1. 
a 
a 
r 

( 
p 
p 
W 
( 
0 
p 
si 

A 

| 
| 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES. 237 


(F > F’) transforms Q into a point Q’ which has F” as unique foot on g. 


Moreover Q’ cannot lie in the interior of the triangle PF,F,. Hence QF’ must 


intersect F,P + PF, in a point R, which would have two different feet on q; 
contrary to page 230. 

(2.3) The points of & having the same point F of g as foot form a s.L., 
which we call the perpendicular to g in F. 

This is always true as soon as each point of & has exactly one foot on g, 
i.e. the existence of G is unessential. For if P; C 7, and P, C a2, both have / 
as foot, P,P, must intersect g in F. If Q@~F were the intersection, we 
should have 

r(Pi, PF) +r(F, Ps) > r(Pi, P2) =r(P1, + 7(Q, 
and either 
r(P:,F) >r(P1,Q) or r(F,P2) >1(Q, 


If, now, P; is any other point, say in 7,, with F as foot, then for the same 
reason P; must be on (= P,Q). 


As an immediate consequence we have 


(2.4) (A—>B),A,BCdg, transforms the perpendicular p to g at A into the 
perpendicular q to g at B. 


The point on p in z, which has distance r from A. is carried into the 
point in 7, on g which has distance r from B. We consider all the points 
which have distance r from g. They form two curves ¢,', cr”, the one in 7, 
the other in ws. c,1 and ¢,? are equidistant from g and are transformed into 
themselves by each translation along g. We now prove the important fact: 
(2.5) The curves c; and c,? are convex; that is, a suitable one of the sides 
Of ¢r*(¢r*) (called the interior of Cr’ (¢r*) ) together with c,*(e¢,?) has the 


property of containing a s.1. segment AB if tt contains A and B. 


We consider c;! for some r > 0. If c,! contains a s. 1. segment it is as. L., 
since it is transformed into itself by each motion of G. 
We therefore assume that c;! contains no s.1. segment. Let A’, B’ be any 
points of c,1, A’,B’ the corresponding are of c,*. If A’B’ intersects A’B" in 


more points than A’ and B’ we can find a subare AB of A’B’ such that 


~~ 
AB-AB—A-+B. Let x be the half plane determined by AB which contains 


| 
| 


238 HERBERT BUSEMANN. 


——- 
AB—A—B, and D any point of AB—-AB. The perpendicular a to g 
through D does not intersect AB, because each perpendicular intersects ¢,! 
exactly once and AB is met by the perpendiculars through the points of AB. 
Let s be a ray of x* issuing from D. We vary s continuously from the 


position a-7* towards DB. AB being bounded, there exists a first ray »° 


having common points with AB, s°~ DB. Let T be any point of AB-s°. 


Using again the fact that each perpendicular to g intersects c,* exactly once 
one sees that there exists a circular disc with center 7’ such that the (open) 
half Av, which is on the same side of s° as a: x* has no common points with c,". 

Let T, be any point of c,*.. The translation along g carrying T into T, 
transforms Ar into a congruent semi-circular disc with center 7, disjoined 
from c,* and on the same side of c,;' as Ay. Calling this side the exterior and 
the other the interior of c,;', a theorem of Tietze +! shows that the interior 
of cy plus c,* is convex. It is true that Tietze assumes the metric to be 
Euclidean, but the simple proof for Tietze’s theorem given by Reinhardt” 
can be carried over to our case without any change whatsoever. 

From this proof it follows that at each point of c,' there exists a sup- 
porting line (this can be proved quite generally for convex curves in arbitrary 
two-dimensional SL spaces). For, since c,1 contains no segment and the 
s. 1. § bearing s° certainly contains an exterior point of c,1, the s.1. 5 must be, 
except for 7, completely in the exterior of c,'._ By translations of § along g 
we get supporting lines at each point of c,'. It is also easy to see that 5 is 
the only supporting line of c,* at 7’ such that c,1 has at each point a unique 
tangent. But we do not need this later on. We determine now which side 
of c,' is the interior by proving 


(2.6) The domain bounded by c, (cr?) and g is convex. 


Using T and § with the same meaning as previously, at least one, say J, 
of the two rays on § issuing from T' does not intersect g. Let the perpendicular 
to g through T cut g at Q and let Q,; be a point on g on the same (“ right”) 
side of the perpendicular as *. If r(Q,Q:) >0 is sufficiently small the 
translation (Q — Q:) transforms § into an s.1. 5, intersecting § in a point L 
of *. If T, is the image of T under (Q—@Q,), the ray LT, contains the 


image of §. c, intersects all perpendiculars to g. Therefore, if the assertion 
were not true, i.e. if § and 5, were except for 7 respectively 7, between ¢;' 


[12]. 2 [10]. 


i 
le 
( 
8 
if 
il 
to 
el 
to 
th 
( 
r( 
he 
(. 
(2 
L 
Ca 
B, 

4 
| 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES. 239 


and g, the ray LT, would have to intersect all perpendiculars to g on the 


right side of the one through LZ. Let X, traverse LT, and let X be the point 


where the perpendicular to g through X, intersects #. Since each curve c,* 
is cut at most twice by §, and 5, and these s.1. are between c,’ and g, the 
numbers r(X;,g) and r(X,g) should converge to certain limits 7,’ and 15 
with OS7r,!Srand0Sr,Sr. Then we must have ry = 7’, since §, is the 
image of 5 under (Q—>@Q,). Hence we should have r(X,X’) which 
contradicts our Lemma (1.9). 

We conclude from (2.6) that an s.1. A intersecting g in a point A meets 
each curve ¢;' or c,” at most once and is therefore transformed by any trans- 
lation (A — A’) of G into an s.1. h’ not intersecting h. For, if h-h’ =S8, 
(A4— A’) would carry § into a point S’ on h’, A’, 8S, 8S’ would be on h’ and 
S and 8’ would be on the same curve c,’ or c,”._ We see, furthermore, that 
if X traverses one of the rays of Z determined by A in one direction, r(X, 9) 


increases. Distinguishing g and g one finds: If Y traverses an asymptote a 


tog in such a direction, that the foot Fy of Y traverses 9, the distance r(Y, Fy) 
either decreases monotonically in the strict sense or is constant for all Y on a. 
For, if r(Y, Fy) is constant on a certain segment of a this segment belongs 
to a curve c,‘; then a coincides with this c,*. This leads to the following 


theorem: 


(2.7) A curve c,* is a shortest line tf, and only if, it ts parallel to g. 


For, let pg be a parallel to g in 7. Our last statement shows that 
t(Y,g) decreases or remains constant if Y traverses p in either direction ; 
hence r(Y,g) is constant and p is a curve ¢,'. 

The converse is a little more involved. We show first: If c,* is a s.1. and 
(A B) transforms the point A of c,1 into the point B, then 
(2. 8) r(A, B) =r(A, B). 

Let Ay, By be the feet of A, B on g. We have 
Bo) = r(A, B). 


Call the images of By respectively B under (A> B)". The points 
B, - -, are on We have 


1(Ao, B,**) r(Ao, By) 
r(A, Br) —n-r(A, B) 


— 
j 

- 
| 


240 HERBERT BUSEMANN. 


hence 


n+ | r(Ao, Bo) —r(A, B)| =| r(Ao, Bo") —r(4, 
<_r(A, Ao) + Br, Bo) = 2r. 


The same argument gives the more general fact: If (A—>B) transforms 4 
into B, then 

r(A,B) =r(A,B). 
For, considering the images of By and B, under (A > B) we find, as above, 


n:1(A, B) + 2r=n-r(A,B). 


We remark, furthermore, that the metric characterization (1.7) of the limit 


spheres yields: 

(2.9) (A—>B) transforms the limit circle L(A, 9) into L(B, 9) and L(A,9) 
into L(B, q)- With the help of (2.8) and (2.9) we prove (2.7) as follows. 
If c,1 were an s.1. but not an asymptote, for instance, to q, the asymptote a 
to g through a point P of c,' would be different from c,’. The limit circle 
to g through P may cut g in Py. Let (Po Qo) be any translation in the 
direction 9. (Po — Qo) transforms L(Po, 9) into L(Qo, q)- Designating the 


intersections of L (Qo, 9) with c,' and a by Q and Q’, we would have 
r(P, Q’) =r(Po, Qo) on account of (1.8) and r(P,Q) =r(Po, Qo) as a con- 
sequence of (2.8), in contradiction to (1.10). 

We see that a translation (A — B) along g is also a translation (A — B) 
along ¢,!, with r(A, B) = r(A, B), if is an s.1. and A (2.6) shows 
that then the curves c, with 0<r< 71 are also s.1, Putting our results 


together we find: 


(2.10) If > 0, isan s.1., then the curves with OS rS are also 
s.1.; all these c,1 are parallel to each other; and a translation (A — B) along 
g is at the same time a translation (A’—>B’) along any of the c,* with 
r(A, B) =r(A’, B’). Ans.l. h intersecting g intersects all c,1 forO0 Sr 
If h’ is the transform of h under (A— B), A, B Cg, the s.1. h and h’ cut oul 
segments of the same length r(A, B) on all these c,* and two different c,* cut 
out segments of equal length on h and h’. 


This could lead to the conjecture that if through each point of the plane 
a parallel to g exists (we say in this case: the parallel axiom holds with 


f 
t 
n 
0 
0 
0 
ir 
or 
E 
P 
th 
su 
r( 
P 
(2 
| 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES. 241 


respect to g), the parallel axiom must hold with respect to each s.1. But we 
show by an example that this is not necessarily so, not even in Desarguesian 
geometry. We introduce a metric in the part |y| <1 of the (z,y)-plane 
by putting 
. — 1) (y2 + 1) | 
1( (1, 91), (2, y2)) =| — | + | log (y: #1) (y2—1) 
+ V (1 — + (91 — y2)?. 


One sees easily that r( ) defines an SL space and that the parts of the 
Euclidean straight lines in | y | < 1 are the s.1. of the metric, which therefore 
is Desarguesian. Furthermore, the parallel axiom holds with respect to the 


z-axis and all translations along the z-axis exist. We remark that the s. 1. 
«= constant, | y | <1, are equidistant from each other. The circles of this 
metric are in general not convex. A result of P. Funk ** shows that it is 
impossible to introduce in | y| <1 a (Desarguesian) metric with these s. 1. 
for which the equidistant curves to each straight line are again straight lines. 
Section 4 of this paper implies that one cannot find both a metric with these 
s.l. and translations along two s.1. which are neither parallel nor asymptotes 
to each other. 

We have seen that the existence of G, the Desarguesian character of the 
metric, and the parallel axiom with respect to g do not wmply the validity 
of the parallel axiom with respect to each s.l. On the other hand, we are 
going to show by another example that the existence of G and the validity 
of the parallel axiom for each s.1. do not imply the Desarguesian character 
of the metric. 

For, let } be the whole Euclidean plane and let g be the a-axis. We 
introduce a Minkowskian metric 7,(P,Q) in 7 +g (y=0) and a different 
one, r2.(P,Q) in 7. + g(y =0) but so that the diameters parallel to g of the 
unit circles both have the Euclidean length 1. Then one has (e( ) is the 
Euclidean distance) e(P,Q) =1(P,Q) =r.(P,Q) for P,QCg. Let 
Py = (21,41), y: > 0 be any point 7, and P, = (#2, yz), yz < 0 any point of 
m. Then the functions 1,(P,, (z,0)) and 12(P2, (x,0)) are convex; hence 
their sum is convex and there exists a uniquely determined point XyC g, 


such that 

r(P,, = min [7:(Pi, (2, 0)) + ro(Po, (2, 0))] =r( Pi, Xo) + Xo)- 
x 

Putting 


r(P,Q)=n(P,Q) if P,QCm+g 


(P,Q) if P,QCm+g9 


13 In [4]. 
16 


242 HERBERT BUSEMANN. 


defines r(P,@) for all points of the plane. One sees easily that 7(P,Q) 
satisfies all our conditions. The s.1. are the straight lines y= const. and 
straight lines which are broken at a point of the z-axis. Parallel rays in 
7, +g have parallel continuations in z.-+ 9; therefore the parallel axiom 
holds for each s.1, All translations along each line y = const. exist. But the 
theorem of Desargues is not true, if the circles of the two Minkowskian geome- 
tries have no special relations to each other, for instance, if the circles in 7, 
are ellipses and those in zz are not. 

teturning to the general case, suppose that there exists a last 7 = 0, say 
ro, such that c¢,,' is an s.1. gi, and let z* be that half plane determined by 4,, 
which is contained in z;. Then through each point of z* there pass two dif- 


ferent asymptotes to g,. Let a be an asymptote to g,;. From (2.9) it follows 
that the images of a under the translations of G are again asymptotes to 4,, 


ped 
and (2.6) shows that a translation y in the direction g carries c into an s. 1. 
between a and g,. Hence the positive powers of y must carry a into a sequence 


of asymptotes to g,, which, according to (1.11), converges to an asymptote 4 


to g;. & must remain invariant under all translations of G; therefore @ is a 
curve c,’; and since ¢,,1 = g, is the last curve c,! which is an s.1., we must 
have @=g;. This implies that each asymptote a to g, in x* has distance 0 


from g,, and that r(z,gi:) — « if a traverses a in the direction a. From 
<_ <_ 

(1.15) and (1.16) we have that a is also an asymptote to g,; and to each other 


asymptote to g; in x*. Let h be any s.1. in z* which is no asymptote to g. 
Orienting h in the “ same ” way as g, one sees easily that there exists an s. |. 4, 


> 
between h and g which is an asymptote to A and g, and an s.1. dz, which is an 


asymptote to h and g;. Hence, if y tends to infinity on h in either direction, 
one has r(y, gi) > «©. There exists exactly one pair of points S Ch and 
RC gq; such that r(R, S) =r(h, gi). S is determined as the point where a 
curve c,* touches h and F in the foot of S on g, and inversely. 

The first question which arises in this connection is if it is actually possible 
that 9. Ag or0<1%< 0. Ina Desarguesian geometry it certainly can nol 
happen. For, if one of two Euclidean straight line segments (unbounded ones 
admitted) is parallel to another in our sense, both of them must be whole 
straight lines. The only convex domains in the Euclidean plane which con- 
tain a whole straight line are the whole plane, a half-plane, and the part of 


If 
spo 


By 
ferer 
We t 
the s 


these 


The | 
(or g 
(or g 

( 


that b 


carries 
clude 

point 
asympt 


"(a, g) 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES. 243 


the plane between two parallel lines. But there are non-Desarguesian geome- 
tries with 0 << 1%)< ©. 

In order to construct such a metric, let £ be the interior of the unit circle 
of the (,y)-plane. Introduce in ¢ a hyperbolic metric h(P,@) in such a 
way that the straight-line segments in £ become the s.1. Let ¢, be the part 
y= 0, & the part y=0 of and the half-plane y=0. We map 
topologically on ¢, by associating O = (0,0) with itself and a point Q~0 
to that point @ on OQ for which 


e(0,Q) =h(0,@) (e( ) is the Euclidean distance). 


It (P, @) are any points in g and P,Q the points in 7* to which they corre- 


spond, we put 7 
r(B, @) = e(P, Q). 


In this way we have introduced in ¢ a Euclidean metric (for which the s. 1. 
are in general no Huclidean straight line segments) which on 0<a2< 1, 
y=0 coincides with the hyperbolic metric. For two points in g, we put 


r(P,Q) =h(P, Q). 


By applying the same method which led to the combination of the two dif- 
ferent Minkowskian geometries, we define r(P,Q) also for P< 6, Q < &s. 
We thus get a metric in £, which makes ¢ an SL space, with translations along 
thes.1. g;: ¢—0. Taking any parallel g ~ 4g, to g, in & one sees easily that 


these s.1. g and g, satisfy the assumptions of our previous considerations. 


3. Geometries with translations along an s.l. g and its asymptotes. 
The last example also shows that the existence of all translations along g, 
(or g) does not imply the existence of translations along an asymptote to gq, 
(or g) which is not a parallel. 

Our considerations of the general case (compare p. 242) did indicate 

that by translations along g, (or g) any asymptote ag, to g; in 7* can be 


carried into any other one. If there exist also translations along a, we con- 
clude from this that there exist two different asymptotes to g, through each 
pont PT g,. For, if g~g., g would be an asymptote to a, since g is an 
> 
‘symptote to g, and g, to a (compare (1.15)). Then r(a,g) =0, but 


r(a, 9) =r(g:,9) >9. From r(a,b) =0 for each asymptote b to a and 


i 


244 HERBERT BUSEMANN. 
r(g,¢c) =0 for each asymptote c to g = g:, one sees that the asymptotes to g 
are asymptotes to each other; and, since by a translation along a any 
asymptote to g in m2 (7,) can be transformed into g there exist translations 


along each asymptote to g. We have found 


(3.1) If ais an asymptote to g not parallel to g and if all translations exist 


along g as well as along a, then all asymptotes to g are asymptotes to each 
other and all translations along each of these s.1. exist. If b 1s any of them, 
then the two asymptotes to b through any point P not on b are different. 


We shall now study in a little more detail the properties of a metric 
satisfying the assumption of (3.1). Let a, and zz again be the two half 
planes determined by g. At first all considerations will refer only to one of 


these half planes, say 7,. We therefore put P+ L(P,g)m=—L,(P,g). We 
again designate by c,' the curve equidistant from g in m with r(c;',g) =1. 


(2.9) shows that the arcs of the different limit circles L(P,g) between any 
two fixed curves c,,! and c;,! are congruent. The availability of the trans- 
lations along the asymptotes to g allows a considerable strengthening of this 


& 


statement. Each arc AB on L,(P,g) is congruent to an arc on any 1, (Q, 9) 
starting at g and therefore also to an are starting at an arbitrary point. (We 


say the are AB starts at A if A is on L,(P,g) between B and P.) To prove 


this we draw the asymptote a to g through B and the curve through A equi- 
distant from a. Since r(a,g) = 0, this curve intersects g at a point A’. We 
draw L(A’,g) and put a: L(A’,g) =B’. The curve c,' through B’ may 
intersect L,(Q,g) at Q’. We say that QQ’ is congruent to AB (and write 
QQ’ = AB). Since (Q > A’) transforms QQ’ into A’B’, we have QQ’ = A’B’. 
The translation (B— B’) along a transforms L(P,g) into the limit circle 
through B’ to the image gf of g under (B—>B’). g’ is an asymptote to g in 
m2. The limit circles to g’ and g coincide in 7, on account of (1. 13), hence 


QQ’ = A’B’ = AB. It follows from this that on each are AB of L,(P,9) 


there exists a point ( (the center of AB) with AC = CB and more generally 
points - -,Cn_. with 


nui 
for 


a 

h 

| 
anc 
Let 
fro 

Let 
cent 

| 

ray 

j 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES. 245 


We then write 


PQ = A’B’ = 2AC = 20B 
1 


2 
A B = n G and sO forth, 
m 


and extend this notation to arbitrary non-negative real numbers by a limit 
process. A consideration similar to the proof of (2.5) shows that the limit 


circle L,(P,g) are convex and therefore rectifiable, and that congruent pieces 
have the same length. If one does not wish to use this fact, he may associate 


_~ 
the number 1 to an arbitrary are AB, AB, of any L[,(P,g) and real 


E D 
R 

ru 
Fig. 1. 


numbers to other arcs according to our above prescription. We use AB both 
for the are and its length. 


We intend to determine the length of an are YX, where X traverses g 
and Y a fixed asymptote a to g as a function of the distances on g (see Fig. 1). 


Let ST, S C g, T Ca be any such arc; draw the curve through 7’ equidistant 
from g and the one through S equidistant from a. They may intersect in FR. 
Let the limit circle through R, to g intersect g in 8S; and a in T,. Then, as 


before, S,R, = R,T,. The asymptote to g through FR, intersects ST’ in its 
center R. If the intersection were different from R, say kh C SR — R, then 


(S:-> 8) would transform S,R, into ST, h,R into a, and hence &,R into a 
tay TR’, where R’ would be on the limit circle L(S’, g) through the image 9’ 


of under (S,—> 8S). Put L(S’,g) =7’. Then (7; transforms 7T,R, 


246 HERBERT BUSEMANN. 


into 7S, R,R into g, TR into T’S’, and R,R into a ray SR”, where R” is an 
g; 9 
interior point of 7’S’. But then we should have 


~ ar 
T’R”’ = TR= RS = P'S’ 


whereas 7”R” is a proper subare of R’S’. 
Since = 2RS, we see the s.1. connecting the centers of S,h, and 
SR is an asymptote to g and, since the arcs R,7, and RT also belong to limit 
~ 
circles to R,R, we see that the s.1. connecting the centers of #,T, and RT 
is also an asymptote to g. We thus find: 


Let the points B, C 8,7, and A C ST be chosen in such manner that 


SA AST, S.A, (=< 2\ST ), 0 < 


Then AB is an asymptote to g. 
Furthermore our considerations show that 


S’T’ = 48T — 48,7T,, r(S’, 8) =r(S, 81). 
More generally, if S, and S™ are chosen in such manner that 


8S, C 8’8 and r(S,8n) = nr(S8’, 8S) = nr(S, 81) 
S™ C 8s’ and r(S,8™) = nr(8’, 8) 


< 
and if one puts L(S™,g)-a=T™, L(Sn,g) one has 


S 


— on Bal « = 


n 


Now choose any point H between S’ and S and put a: L(H, 9) = K, Let H, 


be the point on g for which, with L(H,, 9) -a—K,, one has H,K, = 2HK. 
We assert that 

r(H, H,) =r(8,8,). 


We consider the points H, on HH, with r(H,,H) =nr(H,, H) and put 


a°L(Hn,g) =Kn. If, for instance, one had r(H, H,) > r(S,8,) we should 
have 


or 


e 
| 
E 
a 
P 
| 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES. 247 
r(S, Sy) r(S, Hy) 


for large v; hence with an obvious signification of the sign > 
2HK HyKy > 2’°ST 


in contradiction to the fact that ST > AK. 

Up to this point we have found: 

(3.2) There exists a number o > 0 such that, for an arbitrary asymptote b 
to 9 in m, and for an arbitrary pair of points A < B on g with r(A, B) =o, 
the are of L(A, 9) between b and g is congruent to the half of the are of 
L(B, 9) between g and b. 

We now prove: 

(3.3) The points Z of all arcs xY of L(X, 9) with X Cg, Y Ca and 
YZ =AXY,0<A <1, form an asymptote to gq. 

As in a previous proof, it is sufficient to show this for A = 4. Using the 
same notations as before and calling L, L, the centers of AR, respectively 
we conclude from r(H, H,) = r(S,8,) =o that (H > H,) = (SS). 
But (ff — H,) transforms a into the asymptote to q through that point M of 


H,K, for which H,M =HK; hence M=JL,. (S—S,) transforms a into 
E.R; hence Fk =L,L. This proves the assertion. 


Herewith the length of XY can be determined as follows: Let P be the 


_~ 
center of SS, and put L(P,g) =Q. (S—P) transforms ST into a subare 
~ — 

PE of PQ. Let the asymptote to g through F intersect 8,7, in D. Then 
fallin, 
PQ = 8,D since (S— P) transforms a into ED, and, on account of (3.3), 


PQ PQ ST, 28ST 
or 


PQ = V2ST. 


248 HERBERT BUSEMANN. 


Putting 
r(S,X) =—-2 for X C 88’ 
r(S,X) for XC SS, 
we find in this way 
~_~ 
(3. 4) XY = ST (log 


which (except for the normalization o —log 2) is the same formula as in 
hyperbolic geometry.’ 

We have derived (3.4) using only a part of the congruence axioms. We 
show, indeed, by an example that a geometry with all translations along two 
non-parallel asymptotes is not necessarily hyperbolic, not even Desarguesian. 

We again introduce in the domain z* + y?=1 of the (z, y)-plane the 
hyperbolic metric h(P,Q) of —. Designate by h the broken line consisting 
of the two hyperbolic rays issuing from (0,0) and ending at (0,—1) respec- 
tively (V 3, V2). We define distances for the points P’, Q’ on fh as follows: 
Call g the hyperbolic s.1. y=0, | «| <1 and let c,*, c,? be the hyperbolic 
curves equidistant from g in y < 0 respectively y > 0. Each of these curves 
Cr’, cr? intersects h exactly once. We ‘draw the curves passing through P” and 
Q’. If both of them are curves (c,?), say Gr’, Cr. (Cr,7, Cr.”), We put 
r(P’,Q’) =|r1—12|. If one is the curve c;,,1, the other c,,? we put 
r(P’,Q’) =1r1 +12. Call G the smallest group of hyperbolic motions con- 
taining the translations along the s. 1. ending at (— 1,0). @ can be generated 
by the translations along g and the hyperbolic rotations around the point 
(—1,0). By a suitable transformation of @ any pair of points P,Q, such 
that PQ (by XY, respectively XY, we designate during the discussion of this 
example the Euclidean straight line, respectively segment, connecting X and 
Y) does not pass through (— 1,0), can be transformed into exactly one pair 
of points P’,Q’ on h. We put 


(3. 5a) r(P,Q) =r(P", Q). 
For pairs of points P,Q, where PQ passes through (— 1,0) we put 
(3. 5b) r(P,Q) =h(P,Q). 


It is easy, but a little tedious, to confirm that the function r(P, Q) defines 
an SL space. The s.1. of this metric are h and its transforms under @ and 
the hyperbolic s.1. issuing from (—1,0). Evidently r(P,Q) remains in- 


**See for instance [6], p. 55. 


Vi 


ne 


th 


bolic 
0 
whicl 
issuir 
hyper 
conta: 


16 


| 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES. 249 


variant under all transformations of G. The theorem of Desargues does 
not hold. 

To find a configuration confirming this statement (comp. Fig. 2), we take 
the Euclidean ellipse e with center at (—4,0) having third order contact 
with the unit circle at (—1,0). e is a limit circle in both our and the hyper- 


(Vz, Vz) 


(~10) 


(O.-!) 


Fig. 2. 


bolic metric: 1° Take a point A on e in y > 0 such that the hyperbolic tangent 
toe at A intersects g at a point B,). A divides this tangent into two rays of 
which the one containing B,, is a part of an s.1., hy, of our metric. Choose gz 
issuing from (— 1,0) such that it intersects h, in a point B,,. Take now two 
hyperbolic rays issuing from A and enclosing a sufficiently small angle « which 
contains h, in its interior. Let these rays intersect g; in By; and Bai, (i= 0,2; 


See [13], p. 356. 


250 HERBERT BUSEMANN. 


Jo=4g) where B.; is to be understood to be between (— 1,0) and Boi. Then 
the hyperbolic rays ABoo and Bs Bs. are also rays of our metric. Let ho and 


96 
~ 


he be the s.1. of our metric containing ABoo and BzoBze, respectively. Then 


ho and h, pass through A but hz does not. We choose D and H on ew 
(higi = By) such that D is between F and Byz. If « is sufficiently small, the 


hyperbolic segments BoD, BoD, Bool and B..E are also segments of our 
metric. The points BooD Book and BoD Book together with (— 1,0) lie on 
a hyperbolic s.1. g;, which is also an s.1. of our metric. Hence corresponding 
sides of the triangles By). DB.) and Bo24B.. intersect on gi, but the s.1. ho, hy, 
and h», through corresponding vertices of these triangles do not pass through 


one point. 


4. Geometries with translations along two s.1. which are not asymp- 
totes to each other. We now assume that all translations along two s.1. 9 
and h exist where h is not an asymptote to g (and hence g not to h, compare 
(3.1)). If h and g do not intersect, the curve which touches h and is equi- 
distant from g intersects the curve which touches g and.is equidistant from h 
in exactly two points. (This follows from (2.6), (2.7) and page 242). If 
A is one of these points, a suitable translation along g puts h into a position 
so that it passes through A and a translation along h does the same for g. 
These two s.1. through A are different and all translations along them exist. 
We see that we can restrict ourselves to the case where h and g have a common 
point 0. We are going to prove the following theorem. 


(4.1) If all translations along g and h exist, if h 1s not an asymptote to g, 
and if there is either a parallel g ~g to g ora parallel h’ ~h to h, then the 
metric is Minkowskian. 


Let a parallel g’ 4g exist. We assume that h intersects g in a point 0; 
then it must also intersect g’ in a point 0’. (0 0’) transforms g into an s.l. 
not intersecting g (comp. p. 239) through 0’, hence into g’; and transforms g 
into an s.1. g” intersecting h in the image 0” of 0’ under (0 0’). Since g’ 
is equidistant from g, g” must be equidistant from g’; hence parallel to g’. By 
continuing this and by considering also the images of g’ under (0’— 0), 
(0’->0)?,- - - we find, with the help of (2.7) and (2.10), that the parallel 
axiom holds with respect to g, and that these parallels are equidistant from 
each other. Furthermore, the images of h under any two translations along 9 
cut on h equal segments of all these parallels to g, and any two parallels to 9 
cut out equal segments of all the images of h. If we knew that the parallel 


| 
a 
k 
se 
t 
ig 
pe 
ay 
be 
be 
tr 
wi 
se 
j 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES. 201 


axiom holds with respect to each s. 1. in } we could derive our theorems from 
these statements in few lines. The length of the following proof is due to 
the fact that we prove the parallel axiom and the theorem of Desargues 
simultaneously. 

We introduce codrdinates in our plane 3. We distinguish the two rays 
g*' and g- of g issuing from 0 and the two rays h* and h- of h issuing from 0. 
Through each point P of & there passes exactly one image hz of h under a 
translation along 4 and exactly one parallel g, to g. Put h-gy—Y and 
g'h2=—X. We have, as stated before, 


r(X,0) —=r(P,Y), r(Y,0) =r(P,X). 
As coordinates X, Y of P we take 


gemr(X,0) if XC g*, e=—r(X,0) if XCg 

y=r(Y,0) if Y Chr, y=—r(Y,0) if YCh-. 
We map & onto a Cartesian (x, ¥)-plane 3 by making points with the same 
coordinates correspond. To a motion of & composed of translations along g 
and h corresponds a translation of 3 and, since each translation in § can be 
composed of translations along the x- and the y-axes, there corresponds a 
a motion of & to each translation of 3. 

We wish to show that each s. 1. in § is mapped onto a straight line in 3. 

We first remark that, if any s.1. & in ¥ is mapped onto a straight line & in 3, then 
all translations along & exist because all translations of 3 along & exist. Let 
’ be any transform of & by a translation along g. Since k and &’ cut out equal 
segments of g and its parallels and a translation along k’ transforms a parallel 
to g into a parallel to g, we see that under the translation along & the s.1. k’ 
is transformed into itself. It is therefore equidistant from k and parallel to k. 
We have 


(4.2) If any s.l. k in & is mapped onto a straight line k in 3, then the 
parallel axiom holds with respect tok. The images of the parallels to k in & 
are the Euclidean parallels to k in 3. All translations along k exist. 


Let ¢ be any s.1. in 3, @ its image in 3. To each curve in § which can 
be deduced from é by a translation there corresponds an s. 1. in 3%, since e can 
be carried into this s.1. by a motion of %. Therefore we know that by any 
translation of 3 the curve 2 is transformed into a curve which either coincides 
with ¢, or intersects é in one point, or is disjoined from é. If é has no inter- 
sections with one of its images without being identical with it, é is a straight 


| 
] 


252 HERBERT BUSEMANN. 


line. If é@ has intersections with some of its images, the fact that it has at 
most one intersection with each of them, implies that é is a convex curve 
without parallel supporting lines.1® Therefore é has two different, well defined 
asymptotes (in the Euclidean sense). 

Now let us assume that there exists an s. 1. e¢) through 0 in & whose image &, is 
a convex curve but not a straight line. Let e, be ans. 1. through 0 tending to ¢. 
Then the image é, of e, tends to é. The angle over which the directions of the 
supporting lines to é, range tends to an angle containing the corresponding angle 
for é.’7 Since we assume that the latter is positive, for small « the former one is 
positive too and has a common interior direction with the latter. Then one can 
find chords G and G of é respectively @ which are parallel to the common 
direction and have the same length (one or both of these chords may be sub- 
arcs of é respectively é). By a translation of § the chord G can be carried 
into Gp. Let & be the image of é, under this translation. é and é have two 
common points; hence (since é’ is the image of an s.1. e’ in 3) & =é&. We 


see, then, that 


(4.3) For small «, the curve & can be transformed into é by a trans- 
lation of 


We now consider all s.1. through 0 in 3. The images of some of them 
are straight lines. These straight lines form a closed set. We consider one, 
W, of the open angles which is bounded by two such straight lines é,, ¢., but 
contains none of them. From (4.3) it follows that all the s.1. e of = in W 
are mapped onto convex curves @ in W, each of which can be transformed 
into any other by a translation of %. If e—e, (e—->e.), we must have 
é—>é, (€—é,). Therefore, one of the Euclidean asymptotes, é’,, of ¢ must 
be parallel to é, and the other must be parallel to é. Since the Euclidean 
distance of 2’, and @ vanishes, one sees from the fact that translations of 
correspond to motions of &, that r(e’:,¢) = 0, also. According to (4.2) ¢’; is 
an s.l. and on account of (1.16), ¢ is an asymptote to e’, such that e, and ¢ 
would be two different asymptotes to ¢’, through 0. We have proved that all 
s.l. through 0 are mapped onto straight lines through 0. With the help of 
(4.2) and (2.10) one concludes easily that the metric is Minkowskian. 

The other case, where there exists neither a parallel to g nor to h is much 


easier to deal with. We prove 


1° This can easily be shown. One finds a proof for this and other questions con- 


nected with it in [10]. ; 
17.4 proof for the analogous fact in a space of any dimension can be found in 


[1], p. 35. 


| 
i 
5 
| 
| 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES. 253 


(4.4) If all translations along g and h exist, where h is not an asymptote 
to g, and if through each point of & not on g there pass two different asymp- 
totes to g and through each point not on h two different asymptotes to h, then 
the metric 1s hyperbolic. 


The idea of the proof is obvious: One considers two non-intersecting s. 1. 
Jz, and gz, along which all translations exist and which are not asymptotes 
to each other. Then one takes a point S between these s.1. and the curves 
equidistant from gz, and gz, through 8. They will (with a proper choice of S) 
intersect in a further point S’. The translation along gz, carrying S into S’ 
followed by that along g,, carrying S’ into § is a rotation around 8S. By 
keeping S fixed and varying gz, continuously, one varies the rotations around 
§ continuously and gets in this way the full group of rotations around S. To 
make this a rigorous proof one has to show that gz, belongs to a continuous 
family of s.1. which are not asymptotes to gz, and along which all translations 
exist, and, furthermore, that the rotations around S really vary when gz, moves 
within that family. 

To carry this out in detail, we prove first, that, if the s.1. g and h (of 4. 4) 
intersect (which according to p. 250 always can be supposed to be the case), 
the transforms of g under the translations along h are not asymptotes to each 
other. 

Put g:-h=0. If by (0-— 0’), 0’Ch,g were transformed into the 
asymptote g’ to g through 0’, it would easily follow from p. 242 that each 

translation along h would transform g into an asymptote to g. Therefore we 
would have the situation of section 3. L(0,g) does not coincide with h; 
otherwise, a translation (0—P’) along g would transform L(0,g) into 


L(P’,g) and, on the other hand, into an s.1. h’. Since (0, q) and L(P’,q) 
are equidistant, h and h’ would be equidistant, and therefore parallel. Because 


L(0,g) is different from h, a suitable asymptote g, to g would intersect 


L(0,g) in a point Q,, such that the segment Q1:Q10 of g, between L(0, 9) 

and h has a length of the form o/2", where o has the same significance as in 


(3.2). In order to fix the ideas, let this segment be in the exterior of L(0, @). 
Let the curve c, equidistant from h and through Q,, intersect g in Vor Let 


us introduce the point Qo. on g, for which Qo,: is the center of @Q 2.0, the 
curve ¢, equidistant from h and through Qo,2, and the points Qi,o on 0Q4,0 


such that 


254 HERBERT BUSEMANN. 


1(Qi,0,0) 7(Q1,0, 0). 


The asymptote to g through Qi,) may intersect c, in Qi,, and cz in Qi,o. Then 
it follows from our assumption (that the asymptotes to g are the images of g 
under the translations along h) that all segments Qi,; Qi,j:1 have length o/2"; 


that L(0, 9) passes through Q,, and Q2.; and that the limit circle to g through 
Qi,o passes through the points Qis:,1 and Qi,2,2. From the signification of o 
it follows, then, that (Qo,2—0) carries into Q1,0, Into Yoo and 


0, 5) 


C, 


om Q,, 


Fig. 3. 


on 


into Q3,o. By considering (0 > Qo,2), one sees that Q1,2, Q2,2, Ys,2 must lie on 
one s.1.; hence it would follow from (2.5) to (2.7%) that the curve c, must 
be an s.1. parallel to h, in contradiction to the assumptions of (4. 4). 

We designate the transforms of g under (0X), XY Ch, by ga (see 
Fig. 4). Let Xo, X;, Xz be three different points on h such that Xy is between 
X, and X,. We choose a point S on g,, so that the curves c, equidistant from 
Je, and cz equidistant from gz,, through S, both cross gz, and contain in their 
exteriors, except for S, the same one of the two rays of gz, determined by 8. 
This is always possible if X, is sufficiently near to X,. Let Sz, be the other 


©} 


| 
T 
95 
y C0 
| ch 
Te) 
8, 
in 
| 
of 
tio 
{ 
eX] 
i he 
ver 
(in 
| 


ON TRANSLATIONS IN GENERAL PLANE GEOMETRIES. 255 


intersection of c, and cz, We consider the translations 7, and 7. along ga, 
respectively gz, which carry § into Sz, If 82, is between gz, and gz, we form 
T.T';* (first T, then T,*) ; if Sz, is between gz, and gz, we form 7,7. (for 
Sz, gz either of these motions will do). Assume the first case and put 
ST.+—=K. By T.7,* the point K is carried into a point Kz, on the side 
of gz, Other than the one containing K. T.7 is a rotation with center 9 
different from the identity. We now make the analogous transformation with 
gz instead of gz,, X C X.X1: We draw the curve equidistant from gz, and 


Fig. 4. 


through § which may intersect c. in S, (in addition to 8). The motion of > 
composed of the translation along gz, carrying S into Sz and the translation 
along yg, carrying S, into S may move K into K,. The points Kz are on the 
circle K(S,K); Kz depends in a continuous manner on gz and does not 
remain fixed if XY varies from X, to Xo. For, if gc = ga, (= Ca,), the point 
Sz, is on c. between S and Sz,; therefore, the translation along gz, moving S 
into S,, moves K into a point P on cz between K and S, and (S,,—> S) moves 
P into K,, which, therefore, is between gz, and gz, (or on gz,, if gx, passes 
through § and S;,) and different from 

Thus there exist rotations with S as center transforming K’ into all points 
of an arc of K(S, K) leading from K,, to K,,; hence, the full group of rota- 
tions around S exists. Since, by suitable translations along g and h the point 
Scan be moved into each point of the plane, all rotations around each point 
exist. It is well known that this implies that the metric is hyperbolic.7® 


** The only place where this fact is stated exactly in the form used here, seems to 
be [5], Anhang IV. Of course this is much too deep a result for our purpose; one 
Verifies easily in a direct way that the axioms of the hyperbolic plane geometry hold 
(in any of the current forms). 


| 
K 5, 
qx, 
Ky. 
Sy, 
Ky 
| 
| 


256 HERBERT BUSEMANN. 


The theorems (4.1) and (4.4) and the examples (2.11) and (3.5) give 
together 


If there exist all translations along two shortest lines which are neither 
parallel nor asymptotes to each other, then the metric is Desarguesian, namely, 
either Minkowskian or hyperbolic. If h and g are asymptotes or parallel to 
each other the metric is not necessarily Desarguesian, even tf one assumes in 
addition that the parallel axiom either in the Euclidean or in the hyperbolic 
form holds with respect to each shortest line. 


INSTITUTE FOR ADVANCED STUDY. 


REFERENCES. 


Bonnesen, T. und W. Fenchel, “Theorie der konvexen Kérper,”’ Ergebnisse der 
Mathematik, vol. 3 (1934), fase. 1. 

Busemann, H., “ Pasch’sches Axiom und Zweidimensionalitiit,’ Mathematische 
Annalen, Bd. 107 (1932), pp. 324-328. 

Busemann, H., “Uber die Geometrien, in denen die ‘Kreise mit unendlichem 
Radius ’ die kiirzesten Linien sind,’ Mathematische Annalen, Bd. 106 (1932), 
pp. 140-160. 

Funk, P., “ Uber die Geometrien, bei denen die Geraden die kiirzesten Linien sind und 
die Aquidististanten zu einer Geraden wieder Gerade sind,” Monatshefte 
fiir Mathematik und Physik, Bd. 37 (1930), pp. 153-158. 

Hilbert, D., Grundlagen der Geometrie, 7th edition, Leipzig, 1930. 

Liebmann, H., Nichtewklidische Geometrie, 3rd edition, Berlin, 1923. 

Menger, K., “ Untersuchungen iiber allgemeine Metrik,” Mathematische Amnnalen, 
Bd. 100 (1928), pp. 75-163. 

Minkowski, H., Geometrie der Zahlen, Leipzig, 1910. 

Reinhardt, K., “ Uber einen Satz von Herrn H. Tietze,” Jahresber. Deutsch. Mathem. 
Ver., Bd. 38 (1929), pp. 191-192. 

Rosenthal, A., “ Die Translationsordnung ebener Kurven,” Monatshefte fiir Mathe 
matik und Physik, Bd. 45 (1936), pp. 76-91. 

Schilling, F., Projektive und nichteuklidische Geometrie, Bd. 1, Leipzig, 1931. 
Tietze, H., “Eine charakteristische Eigenschaft der abgeschlossenen konvexen 
Punktmengen,” Mathematische Annalen, Bd. 99 (1928), pp. 394-398. 

Veblen, O. and I. W. Young, Projective Geometry, vol. II, Boston, 1918. 


| 
| 

[1] 
| [2] 

[3] 
| [4] 
| [5] 
[6] 
[7] 

[8] 
[9] 

[10] 
i [11] 
| [12] 
| [13] 


| 
4 j 


